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Notation 


Z(N) = integers mod N = Z/NZ. 

If A is an abelian group, we usually denoted by Ay the elements x € A 
such that Nx = 0. Thus for a prime p, we denote by A, the elements of order 
p. However, we also use p in this position for indexing purposes, so we rely 
to some extent on the context to make the intent clear. In his book, Shimura 
uses A[p] for the kernel of p, and more generally, if A is a module 
over a ring, uses A[a] for the kernel of an ideal a in A. The brackets are 
used also in other contexts, like operators, as in Lubin—Tate theory. There is 
a dearth of symbols and positions, so some duplication is hard to avoid. 

We let A(N) = A/NA. We let A® be the subgroup of A consisting of all 
elements annihilated by a power of p. 
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Introduction 


Kummer’s work on cyclotomic fields paved the way for the development of 
algebraic number theory in general by Dedekind, Weber, Hensel, Hilbert, 
Takagi, Artin and others. However, the success of this general theory has 
tended to obscure special facts proved by Kummer about cyclotomic fields 
which lie deeper than the general theory. For a long period in the 20th century 
this aspect of Kummer’s work seems to have been largely forgotten, except 
for a few papers, among which are those by Pollaczek [Po], Artin—Hasse 
[A—H] and Vandiver [Va]. 

In the mid 1950’s, the theory of cyclotomic fieids was taken up again by 
Iwasawa and Leopoldt. Iwasawa viewed cyclotomic fields as being analogues 
for number fields of the constant field extensions of algebraic geometry, and 
wrote a great sequence of papers investigating towers of cyclotomic fields, 
and more generally, Galois extensions of number fields whose Galois group 
is isomorphic to the additive group of p-adic integers. Leopoldt concentrated 
on a fixed cyclotomic field, and established various p-adic analogues of the 
classical complex analytic class number formulas. In particular, this led him 
to introduce, with Kubota, p-adic analogues of the complex L-functions 
attached to cyclotomic extensions of the rationals. Finally, in the late 1960’s, 
Iwasawa [Iw 11] made the fundamental discovery that there was a close 
connection between his work on towers of cyclotomic fields and these p-adic 
L-functions of Leopoldt—Kubota. 

The classical results of Kummer, Stickelberger, and the Iwasawa—Leopoldt 
theories have been complemented by, and received new significance from the 
following directions: 


1. The analogues for abelian extensions of imaginary quadratic fields in 
the context of complex multiplication by Novikov, Robert, and Coates— 
Wiles. Especially the latter, leading to a major result in the direction of the 
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Birch—Swinnerton-Dyer conjecture, new insight into the explicit reciprocity 
laws, and a refinement of the Kummer—Takagi theory of units to all levels. 

2. The development by Coates, Coates—Sinnott and Lichtenbaum of an 
analogous theory in the context of K-theory. 

3. The development by Kubert—Lang of an analogous theory for the units 
and cuspidal divisor class group of the modular function field. 

4. The introduction of modular forms by Ribet in proving the converse of 
Herbrand’s theorem. The connection between cyclotomic theory and modular 
forms reached a culmination in the work of Mazur—Wiles, who proved the 
“main conjecture”. This is one of the greatest achievements of the modern 
period of mathematics. 

5. The connection between values of zeta functions at negative integers 
and the constant terms of modular forms starting with Klingen and Siegel, 
and highly developed to congruence properties of these constant terms by 
Serre, for instance, leading to the existence of the p-adic L-function for 
arbitrary totally real fields. 

6. The construction of p-adic zeta functions in various contexts of elliptic 
curves and modular forms by Katz, Manin, Mazur, Vishik. 

7. The connection with rings of endomorphisms of abelian varieties or 
curves, involving complex multiplication (Shimura—Taniyama) and/or the 
Fermat curve (Davenport—Hasse—Weil and more recently Gross—Rohrlich). 


My two volumes on Cyclotomic Fields provided a systematic introduction 
to the basic theory. No such introduction existed when they first came out. 
Since then, Washington’s book has appeared, covering some of the material 
but emphasizing different things. As my books went out of print, Springer- 
Verlag and I decided to continue making them available in a single volume 
for the convenience of readers. No changes have been made except for some 
corrections, for which I am indebted to Larry Washington, Neal Koblitz, and 
others. Thus the book is kept essentially purely cyclotomic, and as elementary 
as possible, although in a couple of places we use class field theory. No 
connection is made with modular forms. This would require an entire book 
by itself. However, in a major development, a purely cyclotomic proof of the 
“main conjecture”, the Mazur—Wiles theorem, has been found, and I am very 
much indebted to Karl Rubin for having given me an appendix containing 
a self-contained proof, based on work of Thaine, Kolyvagin and Rubin himself. 
For details of the history, see Rubin’s own introduction to his appendix. 

My survey article [L 5] provides another type of introduction to 
cyclotomic theory. First, at the beginning in §2 it gives a quick and 
efficient summary of main results, stripped of their proofs which neces- 
sarily add bulk. Second, this article is also useful to get a perspective on 
cyclotomic fields in connection with other topics, for instance having to 
do with modular curves and elliptic curves. In that survey, I emphasize 
questions about class groups and unit groups in a broader context than 
cyclotomic fields. Specifically, in Theorem 4.2 of [L 5] I state how Mazur-— 
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Wiles construct certain class fields (abelian unramified extensions) of 
cyclotomic fields by means of torsion points on the Jacobians of modular 
curves. The existence of class fields of certain degrees is predicted ab- 
stractly by the pure cyclotomic theory, but the explicit description of the 
irrationalities generating such class fields provides an additional basic 
structure. In that sense, the purely cyclotomic proof of the “main con- 
jecture”, and even the “main conjecture” itself, do not supersede and are 
not substitutes for the Mazur—Wiles theory. 

The first seven chapters of the present book, together with Chapters 
10, 11, 12 and 13 and Rubin’s appendix develop systematically the basic 
structure of units and ideal class groups in cyclotomic fields, or possibly 
Galois extensions whose Galois group is isomorphic to the group of 
p-adic integers. We look at the ideal class group in fields such as Q(p,») 
where p,, is the group of p"-th roots of unity. We decompose these 
groups, as well as their projective limits, into eigenspaces for characters of 
(Z/pZ)*, and we attempt to describe as precisely as possible the structure 
of these eigenspaces. For instance, let h, denote the class number of Q(n,). 
There is already a natural decomposition h,=hjh,, where h; is the 
order of the (+1)-eigenspace, and h, is the order of the (—1)-eigenspace 
for complex conjugation, and similarly for p” instead of p. Part of the 
problem is to determine as accurately as possible the p-divisibility of h; 
and h,, and also asymptotically for p” instead of p when n> co. 

A number of chapters are logically independent of each other. For instance, 
readers might want to read Chapter 10 on measures and Iwasawa power series 
immediately after Chapter 4, since the ideas of Chapter 10 are continuations 
of those of Chapter 4. This leads naturally into the Ferrero—Washington 
theorems, proving Iwasawa’s conjecture that the p-primary part of the ideal 
class group in the cyclotomic Z,,-extension of a cyclotomic field grows linearly 
rather than exponentially. This is first done for the minus part (the minus 
referring, as usual, to the eigenspace for complex conjugation), and then it 
follows for the plus part because of results bounding the plus part in 
terms of the minus part. Kummer had already proved such results. An- 
other proof for the Ferrero-Washington theorem was subsequently given 
by Sinnott [Sin 2]. 

The first seven chapters suffice for the proof of the “main conjecture” 
in Rubin’s appendix, which does not use the Ferrero—Washington theorem. 
However, using that theorem in addition gives a clearer picture of the 
projective limit of the ideal class groups as module over the projective 
limit of the group rings Z,[G,], where G, is the Galois group of Q(n,») 
over Q(u,), and therefore also as module over Z,. This module plays a 
role analogous to the Jacobian in the theory of curves. The Ferrero— 
Washington theorem states that up to a finite torsion group, this module 
is free of finite rank over Z,. The “main conjecture” gives some descrip- 
tion of the characteristic polynomial of a generator for the Galois group 
playing an analogous role to the Frobenius endomorphism in the theory 
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of curves. Questions then arise whether these characteristic polynomials 
behave in ways similar to those in the theory of curves over finite fields. 
These questions pertain both to the nature of these polynomials, e.g. 
their coefficients and their roots (Riemann type hypotheses); and also 
concerning the behavior of these polynomials for varying p. Cf. [L 5], 
p. 274. 

After dealing mostly with ideal class groups and units, we turn to a 
more systematic study of Gauss sums. We do what amounts to “Dwork 
theory”, to derive the Gross—Koblitz formula expressing Gauss sums in 
terms of the p-adic gamma function. This lifts Stickelberger’s theorem 
p-adically. Half of the proof relies on a course of Katz, who had first 
obtained Gauss sums as limits of certain factorials, and thought of using 
Washnitzer-Monsky cohomology to prove the Gross—Koblitz formula. 

Finally, we apply these latter results to the Ferrero—Greenberg theorem, 
showing that L,(0, y) #0 under the appropriate conditions. We take this 
opportunity to introduce a technique of Washington, who defined the p-adic 
analogues of the Hurwitz partial zeta functions, in a way making it possible 
to parallel the treatment from the complex case to the p-adic case, but in a 
much more efficient way. 

Some basic conjectures remain open, notably the Kummer-—Vandiver 
conjecture that h} is prime to p. The history of that conjecture is inter- 
esting. Kummer made it in no uncertain terms in a letter to Kronecker 
dated 28 December 1849. Kummer first tells Kronecker off for not under- 
standing properly what he had previously written about cyclotomic fields 
and Fermat’s equation, by stating “so liegt hierin ein grosser Irrthum 
deinerseits ...”; and then he goes on (Collected Works, Vol. 1, p. 84): 


Deine auf dieser falschen Ansicht beriihenden Folgerungen fallen somit von 
selbst weg. Ich gedenke vielmehr den Beweis des Fermatschen Satzes auf 
folgendes zu grunden: 

1. Auf den noch zu beweisenden Satz, dass es fiir die Ausnahmszahlen / stets 
Einheiten giebt, welche ganzen Zahlen congruent sind fiir den Modul 4, 
ohne darum Ate Potenzen anderer Einheiten zu sein, oder was dasselbe ist, 
dass hier niemals D/A durch 4 theilbar wird. 


In our notation: 4 = p and D/A =h}. Kummer wrote D/A as a quo- 
tient of regulators, expressing the index of the cyclotomic units in the 
group of all units. This index happens to coincide with h} (cf. Theorem 
5.1 of Chapter 3). Thus Kummer rather expected to prove the conjecture. 
According to Barry Mazur, who reviewed Kummer’s complete works 
when they were published by Springer-Verlag, Kummer never mentioned 
the conjecture in a published paper, but he mentioned it once more in 
another letter to Kronecker on 24 April 1853 (loc cit p. 93): 


Hierein hangt auch zusammen, dass eines meiner Haupresultate auf welches 
ich seit einem Vierteljahre gebaut hatte, dass der zweite Faktor der Klassen- 
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zahl D/A niemals durch / theilbar ist, falsch ist oder wenigstens unbewiesen ... 
Ich werde also vorlaufig hauptsachlich meinen Fleiss nur auf die Weiter- 
fiihrung der Theorie der complexen Zahlen wenden, und dann sehen ob etwas 
daraus entsteht, was auch uber jene Aufgabe Licht verbreitet. 


So the situation was less clear than Kummer thought at first. Much later, 
Vandiver made the same conjecture, and wrote [Va 1]: 


... However, about twenty-five years ago I conjectured that this number was never 
divisible by / [referring to h* ]. Later on, when I discovered how closely the question 
was related to Fermat’s Last Theorem, I began to have my doubts, recalling how 
often conjectures concerning the theorem turned out to be incorrect. When I visited 
Furtwangler in Vienna in 1928, he mentioned that he had conjectured the same 
thing before I had brought up any such topic with him. As he had probably more 
experience with algebraic numbers than any mathematician of his generation, I felt 
a little more confident .... 


On the other hand, many years ago, Feit was unable to understand a step 
in Vandiver’s “proof” that p{h* implies the first case of Fermat’s Last 
Theorem, and stimulated by this, Iwasawa found a precise gap which is such 
that there is no proof. 

The Iwasawa—Leopoldt conjecture that the p-primary part of C~ is cyclic 
over the group ring, and is therefore isomorphic to the group ring modulo 
the Stickelberger ideal, also remains open. For prime level, Leopoldt and 
Iwasawa have shown that this is a consequence of the Kummer-Vandiver 
conjecture. Cf. Chapter IV, §4. 

Much of the cyclotomic theory extends to totally real number fields, as 
theorems or conjecturally. We do not touch on this aspect of the question. 
Cf. Coates’ survey paper [Co 3], and especially Shintani [Sh]. 

Coates, Ribet, and Rohrlich had read the original manuscript and had 
made a large number of suggestions for improvement. I thank them again, 
as well as Koblitz and Washington, for their suggestions and corrections. 


New Haven, 1989 SERGE LANG 
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Character Sums 


Character sums occur all over the place in many different roles. In this 
chapter they will be used at once to represent certain principal ideals, thus 
giving rise to annihilators in a group ring for ideal classes in cyclotomic fields. 

They also occur as endomorphisms of abelian varieties, especially Jacob- 
ians, but we essentially do not consider this, except very briefly in §6. They 
occur in the computation of the cuspidal divisor class group on modular 
curves in [KL 6]. The interplay between the algebraic geometry and the 
theory of cyclotomic fields is one of the more fruitful activities at the moment 
in number theory. 


§1. Character Sums Over Finite Fields 
We shall use the following notation. 
F = F, = finite field with g elements, g = p”. 
Z(N) = Z/NZ. 
€ = primitive pth root of unity in characteristic 0. Over the complex 
numbers, ¢ = e27#/?, 
Tr = trace from F to F,. 


Ly = group of Mth roots of unity. 
A: F-> p, the character of F given by 


A x) as gTT), 


yx: F* — y,_, denotes a character of the multiplicative group. 
We extend y to F by defining y(0) = 0. 


The field Q(uiy) has an automorphism o_, such that 


o-4:0R 07. 


1. Character Sums 


If « € Q(uy) then the conjugate & denotes o_,«. Over the complex numbers, 
this is the complex conjugate. 
The Galois group of Q(uy) over Q is isomorphic to Z(N)*, under the map 


Ch> 6, 
where 
Oe FC. 


Let f, g be functions on F with values in a fixed algebraically closed field of 
characteristic 0. We define 


Sf, g) = a S(®)g(x). 
We define the Fourier transform 7f by 
Tf(y) a >, SoM —29) = DSP, 


Then 7f is again a function on F, identified with its character group by A, 
and T is a linear map. 


Theorem 1.1. Let f~ be the function such that f-(x) = f(—x). Then 
T*f = qf~, that is 


T*f(z) = af(-2). 
Proof. We have 


T*f(z) = >, > S@)A(—yx)A(—2y) 
= > fx -— 2) > A—yx). 


If x #0 then y+>A(yx) is a non-trivial character, and the sum of the 
character over F is 0. Hence this last expression is 


= qf(—2z) 
as desired. 


We define the convolution f * g between functions by the formula 
(f* ay) = > fs — »). 


A change of variables shows that 


Seg=arf. 


§1. Character Sums Over Finite Fields 


Theorem 1.2. For functions f, g on F we have 
T(f * g) = (f)(78) 
T(fg) = ; Tf « Te. 
Proof. For the first formula we have 
T(f * 8)(2) = DF * sXM—29) = DD Sade — DA-2). 
We change the order of summation, let tf = y — x, y = x + t, and find 
= 2 S)M—2x) D, OA — 21) 
= (Tf)(Tg)(2), 
thereby proving the first formula. 


The second formula follows from the first because T is an isomorphism 
on the space of functions on F, so that we can write f = Tf, and g = Tg, 
for some functions f,, g;. We then combine the first formula with Theorem 
1.1 to get the second. 


We shall be concerned with the Gauss sums (Lagrange resolvant) 


SQA) = SQ) = > XA) 


u 


where the sum is taken over u € F*. We could also take the sum over x in F 
since we defined y(0) = 0. Since 4 is fixed, we usually omit the reference to A 
in the notation. The Gauss sums have the following properties. 


GS 0. Let x, be the trivial character 1 on F*. Then 


SQ) = —-1. 


This is obvious from our conventions. It illustrates right at the beginning the 
pervasive fact, significant many times later, that the natural object to con- 
sider is — S(y) rather than S(x) itself. We shall also write 


S(1) = SCI, 4), 


but the convention remains in force that even for the trivial character, its 
value at 0 is 0. 


GS 1. For any character y # 1, we have Ty = y(—1)S()x7?- 


1. Character Sums 


Proof. We have 


Tx(y) = >, x)A(—yx). 


x 


If y = 0 then Ty(y) = 0 (summing the multiplicative character over the 
multiplicative group). If y # 0, we make a change of variables x = —ty—}, 
and we find precisely the desired value 


x— )SQOx~?). 
GS 2. We have S(%) = x(—1)S(p and for x # 1, S()S(D = x(—1)q, so 


SOS) = 9, fory #1. 


Proof. Note that T?y = T(y(—VS(~x-) = SCOS(y~y. But we also 
know that T?y = gy~. This proves GS 2, as the other statements are obvious. 


Over the complex numbers, we obtain the absolute value 


ISQ)| = 47”. 


We define the Jacobi sum 


I(x 2) = — > axa — >). 


Observe the minus sign, a most useful convention. We have 
J(1, 1) = —(q — 2). 
GS 3. If 11%. 4 1 then 


Ja 1) = —SQ2SE. 


In particular, J(\, ¥2) = Ja, 1) = 1. Tf x1%2 = 1 but not both x1, x2 
are trivial, then 


IQs %2) = ya(— 1). 


Proof. We compute from the definitions: 
SGa)SGa) = 2 2 tae) A + y) 
= 2 2 ta(X)xalY — x)AC(y) 

=> > u@rlu — MAW) + > a)x2(—-»). 


x u#0 x 


§1. Character Sums Over Finite Fields 


If y:72 # 1, the last sum on the right is equal to 0. In the other sum, we inter- 
change the order of summation, replace x by ux, and find 


> uxaWAU) > x)x2(1 — x), 


thus proving the first assertion of GS 3. If y,7. = 1, then the last sum on the 
right is equal to y,(—1)(q — 1), and the second assertion follows from 
GS 2. 


Next we give formulas showing how the Gauss sums transform under 
Galois automorphisms. 


GS 4. S(x?) = S(x). 
Proof. Raising to the pth power is an automorphism of F, and therefore 
Tr(x”) = Tr(x). 


Thus S(z?) is obtained from S(y) by permuting the elements of F under 
xt>x?, The property is then obvious. 


Let m be a positive integer dividing gq — 1, and suppose that y has order m, 
meaning that 


yr = 1. 
Then the values of y are in Q(u,,) and 


Sq) = SQ; A) € QU; Hp)- 


For any integer c prime to m we have an automorphism o,,, of Q(Um, Hp) 
such that 


Oo1:€t> Ce and a,,, is identity on p,. 
For any integer v prime to p, we have an automorphism oj,, such that 
O,,,:e+>e" and o,,, is identity on pn. 


We can select v in a given residue class mod p such that v is also prime to m. 
In the sequel we usually assume tacitly that v has been so chosen, in particular 
in the next property. 


GS 5. Fei S(X) = SX) and 41,,S(x~) = X)SCO) 


Proof. The first is obvious from the definitions, and the second comes by 
making a change of variable in the Gauss sum, 


xrevolx, 


1. Character Sums 


Observe that 
01, A(x) = gV Tr) — gTr(vx) — A(vx). 


The second property then drops out. 
The diagram of fields is as follows. 


Q(ums Hp) 


Q 
From the action of the Galois group, we can see that the Gauss sum 
(Lagrange resolvant) satisfies a Kummer equation. 


Qn) Q(u,) 


Theorem 1.3. Assume that y has order m. 


(i) S(y)" Hes in Q(Um). 
(ii) Let b be an integer prime to m, and let o, = oy,,. Then S(x)°~% lies in 


Q(Um)- 


Proof. In each case we operate on the given expression by an automorphism 
o,,, with an integer v prime to pm. Using GS 5, it is then obvious that the 
given expression is fixed under such an automorphism, and hence lies in 


QUn)- 


§2. Stickelberger’s Theorem 


In the first section, we determined the absolute value of the Gauss sum. 
Here, we determine the prime factorization. We shall first express a character 
in terms of a canonical character determined by a prime. 

Let p be a prime ideal in Q(u,_,), lying above the prime number p. The 
residue class field of is identified with F = F,. We keep the same notation 
as in §1. The equation X4-! — 1 = 0 has distinct roots mod p, and hence 
reduction mod p induces an isomorphism 


lg-1 > F* = F,*. 


Phrased another way, this means that there exists a unique character w of 
F* such that 


@(u) mod p = u. 


This character will be called the Teichmuller character. This last equation 
will also be written in the more usual form 


@(u) = u (mod p). 


§2. Stickelberger’s Theorem 


The Teichmuller character generates the character group of F*, so any 
character x is an integral power of w. 
We let 


=e-—l. 


Let % be a prime ideal lying above p in Q(u,-1, Mp). We use the symbol 
A ~ Bto mean that A/B is a unit, or the unit ideal, depending whether A, B 
are algebraic numbers or (fractional) ideals. We then have 


p~ prt 


because elementary algebraic number theory shows that p is totally ramified 
in Q(e), and p is totally ramified in Q(u,-1, Hp): 

Let k be an integer, and assume first that 0 < k <q — 1. Write the 
p-adic expansion 


k = ko + kip teeet ky-1p""1 


with 0 < kj < p — 1. We define 
s(k) = ko + ky +--+ + Kn-i. 


For an arbitrary integer k, we define s(k) to be periodic mod q — 1, and 
defined by the above sum in the range first assumed. For convenience, we also 
define 


y(k) = ko! ki!++ + Kn-a! 


to be the product of the k;,! in the first range, and then also define y(k) by 
(q — 1)-periodicity for arbitrary integers k. If the dependence on q is 
desired, one could write 


Sak) and »,(k). 
Theorem 2.1. For any integer k, we have the congruence 


S(@-*,e™) —1 


In particular, 
ordy S(w~*) = s(k). 


Remark. Once more, we see how much more natural the negative of the 
Gauss sum turns out to be, for we have 


—S(@-*, 4 1 
Se = mes 3B) 


with 1 instead of —1 on the right-hand side. 


1. Character Sums 


Proof of Theorem 2.1. If k = 0 then the relation of Theorem 2.1 is clear 
because both sides of the congruence to be proved are equal to —1. We 
assume | < k < q — 1, and prove the theorem by induction. Suppose first 
that k = 1. Then 


S(w-*) 22 > @(u)~1gTFO 
= > o)-4(1 + 2) 
= > ow)" + (Tru)n + O(n) 
(interpreting Tr u as an integer in the given residue class mod p). But 


@(u)~* Tru) = uu t+ uP? +-+- + w"*) mod B 
Pt uPot peep yer tnd, 


Each u+> u*’-} is a non-trivial character of F*. Hence 
> ou)-? Tru) = ¢ — 1 = —1 (mod 8) 
and therefore 


S(O") = —1 (mod 8) 


thus proving the theorem for k = 1. 
Assume now the result proved for k — 1, and write 
ak = @-t@-&-Y 
for 1 < k < q — 1. We distinguish two cases. 
Case 1. p|k, so we can write k = pk’ with 1 < k’ < q — 1. Then trivially 
5(k) = s(k’) and (k) = y(k’) 


because k has the same coefficients k, as k’, shifted only by one index. Let 
Op = Gp,1, SO Gy leaves ¢ fixed. Since 


o,S(@-") = S(w-?") = S(w~*), 
we find that applying a, to the inductive congruence 


S@-")_ —-1 


oa _ Wk) (mod $B) 


yields a proof for the present case, because a, is in the decomposition group 
of $, whence o,® = 8. 
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§2. Stickelberger’s Theorem 


Case 2. p+ k. Then 1 < ko. Furthermore, 
s(k) = s(k —1) +1 and yk — 1) = (ko — DIK! + Knead 
Then 


S(@7*) _ S(@7ta@~&- es ie S(@~1) S@-* dD) —1 
—1 —1 


= “NSE DIGS oF (mod ® 


To conclude the proof, it will suffices to get the right congruence for J. We 
use GS 3 from §1, to get: 


—J(o~, @-#-D) = > u-(1 — u)-®-)+4-1 (mod 8), 


and the sum is at first taken for u # 0, 1, but with the additional positive 
exponent g — 1 which does not change anything, we may then suppose that 
the sum is taken for u # 0 in F. Hence we get further 


po x (-1y(? ; “ut 
If 7 4 1 then > u/-1 = 0, so we get the further congruence 

—J(a~*, a) = (—1)(q — Kg — ) = —ko (mod 8), 
thereby proving the theorem. 


Having obtained the order of the Gauss sum at one prime above p, we also 
want the full factorization. Suppose that m is an integer >1 and that p { m. 
Let p be a prime ideal above p in Q(x,,) and let 

Np = g = p". 
Let k be an integer such that 


i has order m in Q/Z. 


Let <t> denote the smallest real number >0 in the residue class mod Z of a 
real number ¢. Let 


G = Gal(Q(Un)/Q). 


Define the Stickelberger element in the rational group ring 


ke 


Hk») = 5 Cos Date QIG} 


1. Character Sums 


Let $ be the prime ideal in Q(u,,, up) lying above p. Let w as before be the 
Teichmuller character on F¥. We let o, = 04,1. 


Theorem 2.2. We have the factorization 
S(w-*) Ss PP — DOK, p) ws pak.py 
Proof. We have 
ord,-1g S(w™) = ordy o,S(@~*) 
= ord S(w~**) 
= s(kc) 


by Theorem 2.1. On the other hand, the isotropy group of p in the Galois 
group G consists of the powers 


{op} for i=0,....n—1. 


Hence in the ideal p* the prime o¢+p occurs with multiplicity 


3» 
i=0 q-1 


Hence to prove Theorem 2.2 it will suffice to prove: 


Lemma 1. For any integer k we have 


OeGe D> (74 


Proof. We may assume that 1 < k < q — 1 since both sides are (q — 1)- 
periodic in k, and the relation is obvious for k = 0. Since p* = 1 (mod q — 1) 
we find: 


ke ky kip peed Kyap 
Pk=ky-y + kop) +:++++ ka_2p"~1 (modgq — 1) 
pk = kya + ky-ap +++ + ky-ap""* (mod q — 1) 


Hence 
_ tight-hand side of ith equation | 
q- er q-1 
Summing yields 
sk) +p t+---+ p™*) 1 
xe = 74>- —— aa) a 7’ 


thereby proving the lemma. 
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§2. Stickelberger’s Theorem 


In Theorem 2.2 we note that the Gauss sum is not necessarily an element 
of Q(u,,), and the equivalence of ideals is true only in the appropriate ex- 
tension field. Similarly, the Stickelberger element has rational coefficients. 
By the same procedure, we can both obtain an element in Q(u,,) and a corre- 
sponding element in the integral group ring, as follows. 


For any integers a, b € Z and any real number t, we have 
bit) — <bt>EZ and <at> + <bt> — (at bd €Z. 
The proof is obvious. Let us define R = Z[G], and 
I = ideal of R generated by all elements o, — b with b prime to m. 
Then the above remark shows that 
10° R= ZG]. 


Although we won’t need it, we may prove the converse for general insight. 
The matter is analyzed further in Chapter 2, §3. 


Lemma 2. We have 10 = ROO R. 


Proof. Note that m € I because 
m= —(O14m — (1 + m)). 


Suppose that an element of R@ lies in R, that is 


> z(b)o,0 € R 
with z(b) € Z. Then 


> 20) % > eZ foralle 


whence 
> 2(6)b = 0 (mod m), 


and > z(b)d is in 7. But then 
> 2(b)o = > 2(b)\(oy — b) + > 2(b)b 
is in J, thus proving the lemma. 


It will be convenient to formulate the results in terms of the powers of one 
character, depending on the integer m. Thus we let 


eas -(Np-1 
Xn = wy p-1)/m 
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1. Character Sums 


where wy is the Teichmuller character. We define the Stickelberger element 


of level m by 
Am) = > (2 ee% 
ceZ(m)* m 


As a special case of Theorem 2.2, we then obtain the factorization 
FAC 1. S(xp) ~ p®™, 

Therefore, if b is an integer prime to m, and o, = o»,;, then 
FAC 2. Syn)? 7% ~ pAHmMO— 4p), 


In FAC 2 the algebraic number on the left lies in Q(u,,), and the group ring 
element 6(m)(6 — o,) lies in Z[G], namely 


@— opin =F (CE — CE )oz. 


Thus we have the ideal factorization of the (6 — o;)-power of the Gauss 
sum in terms of powers of conjugates of the prime p in Q(u,,). 

We return later to the application of this factorization to the study of the 
ideal classes in the cyclotomic field, but it is worth while here to mention the 
simplest consequence. In every ideal class there exists an ideal prime to m. 
Since the ideal 

ppOmn0b — 2) 


is principal for every prime » { m, we find: 


Theorem 2.3. Let © be the ideal class group of Q({tm). Then for all b prime 
to m, 


(6 — 6,)0(m) 
annihilates ©. 


For each integer r let 


We are now allowing r to have common factors with m. Let: 


M = module generated over Z by all elements 6, with r € Z, called the 
Stickelberger module, 


SF = MC R, called the Stickelberger ideal. 
Observe that -Z is also an R-module. 
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§2. Stickelberger’s Theorem 


Theorem 2.4. The Stickelberger ideal annihilates the ideal class group of 
Quin). 


Proof. Let 
“= > z(r)0,(m) eR 


r 


be an element of the Stickelberger ideal, with z(r) € Z, and the sum taken with 
only a finite number of coefficients 4 0. Then 


> z(r)r = Omod m. 


Tr 


By Theorem 2.2 we have the factorization 


[I Sup? ~ %, 


and it is immediately verified that the left-hand side lies in Q(u,,) by using 
GS 5 of the preceding section. This proves the theorem. 


Next we look at the Jacobi sums. If d is an integer, then d operates in a 


natural way on R/Z by multiplication. We denote this operation by [d]. 
Thus on representatives, we let 


[d]<t> = <dt>, teR. 
It is convenient to let 
Alay, ag] = [a,] + [a2] — [a1 + ag]. 
Recall the Jacobi sum for 7,72 4 1: 


Fa 19) = SGU. 


Let a,, a2 be integers, a, + a, # 0 mod m. Then from FAC 1 we get: 
FAC 3. I (YX p% Xp%2) ~ jp4la,,4216(m) 


where 


Alay, 4,100) = > ea de <8. 7 (a +a acs, 


and A[a;, a.]0(m) € Z[G] lies in the integral group ring. We know that the 
Jacobi sum lies in Q(u,,), so again we have an ideal factorization of an element 


of Q(Hm). 
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1. Character Sums 


It will be convenient to introduce an abbreviation. Let 
a = (a, a2) 
denote a pair of integers. We let 
A[a,, a2]0(m) = O(@mn)[aj, a2] = O(m)[a]. 


In several applications, e.g., in the next section, the level m is fixed, and 
consequently we omit m from the notation, and write simply 


A(m)[a] = O[a]. 
If dis an integer prime to m then trivially 
o,8[a] = 0[da]. 


The next two sections are logically independent and can be read in any order. 
They pursue two different topics begun in §2. 


§3. Relations in the Ideal Classes 


Let G = Gal(Q(un)/Q), so that elements of G can be written in the form a,, 
with c € Z(m)*. We recall the Stickelberger element 


£ -1 
a) = De 
from formulas FAC 1 and FAC 2. Let 


I = ideal of Z[G] generated by all elements b — o,, with integers b prime 
to m. 


Let p be prime number prime to the Euler function ¢(m). For instance, if 
m = p itself, the prime p does not divide p — 1. The character group on G 
takes its values in ¢(m)th roots of unity. We let g = p” be a power of p such 
that p(m) divides q — 1. We let 0, be the ring of p-adic integers in the un- 
ramified extension of Z, of degree n, so that 0,/po, = 0,(p) is the finite field 
with p" = q elements. Then 0, contains the ¢(m)th roots of unity. If m = p 
then we take g = p and 0, = Z,. 

Let @ be the ideal class group of Q(u,,), and @™ its p-primary component. 
We have an isomorphism 


Zp es) GC” w GE), 


The elementary divisors of @® over Z, are the same as the elementary 
divisors of 


0, @€” over vy. 
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§3. Relations in the Ideal Classes 


If A is an 0,-ideal, on which G operates, we let A(y) be the y-eigenspace. 
We let 


I, = 0,-ideal generated by all elements b — x(b) with integers 5 prime to m. 


By abuse of notation, we write often y(b) instead of y(o,). The important 
special case we shall consider is when m = p, in which case it is easy to 
determine /,. We assume p > 3. 


Lemma 1. (i) If x = @ is the Teichmuller character, then I, = (p). 
(ii) If x is non-trivial and not equal to the Teichmuller character, then 
I, = (1). 


Proof. For (i), we can take an integer b of the form 
b=C+ pu 


where w is a p-adic unit, and € = w(b)isa(p — 1)th root of unity. This makes 
(i) clear, and (ii) is obvious, from the definitions. 


In the next sections we shall deal with Bernoulli numbers systematically. 
For the moment, we need only a special case, so we define ad hoc the first 
Bernoulli polynomial 


B(X)=X-4 


and the first Bernoulli number B, = —4, its constant term. For any function 


f on Z(m) we define 
n= Zrom((S)} 


In particular, 


nan Z.(KS)~fp0 


If x is non-trivial, then > y(c) = 0, and hence in this case, 


Pix LG O 


Then in the present terminology, Theorem 2.3 can be reformulated as 
follows. 


Theorem 3.1. For non-trivial y, the ideal B, I, annihilates @(y). 
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Corollary 1. Assume that m = p is prime >3. If y is not equal to the 
Teichmuller character and is non-trivial, then 


ord B,J, = ord B,,;. 
Proof. Immediate from the lemma and the theorem. 
Corollary 2. If x is equal to the Teichmuller character then B, ;I, = (1), 
and €y) = 0. 


Proof. Mod Z,, we have the congruence 


1% ay ie 
Big) = = > c@(c)71 = - > 2s" (mod Z,). 
Pei P¢ = 


Hence B,,; has a pole of order 1 at p. Lemma 1(i) concludes the proof. 


Corollary 3 (Herbrand’s theorem). Assume again thatm = p. Let y = w1~*, 
with2 <k < p — 2. 1f@(y) ¥ 0, then p|B,,, where B,, is the kth Bernoulli 
number. 


Proof. In the next chapter Theorem 2.5, we shall prove the congruence 


Bro ron = B, (mod P) 


1, 
k 
for k in the given range, and any positive integer n. By Corollary 1, we know 
that B,,; annihilates @(y), and 


By, = By gt) = io (mod p). 


If p does not divide B,,, it follows that B,,; is a p-unit, whence @(y) = 0, thus 
proving Herbrand’s theorem. 


The converse of Herbrand’s theorem has been proved by Ribet [Ri]. 
For analogues on the modular curves, see the [KL] series, especially [KL 6]. 

The reader interested in pursuing the ideas of this section may skip the 
rest of this chapter, read the first section of Chapter 2, and then go to 
Chapter 5. 


§4. Jacobi Sums as Hecke Characters 


Let ¢ throughout this section be a fixed primitive mth root of unity. We con- 
sider the additive group 


Z(m) = Z(m) x Z(m), 
of order m?. Its elements will be denoted by 
@ = (Q, a2), b = (by, ba). 


§4. Jacobi Sums as Hecke Characters 


The dot product is the usual one, a-b = a,b, + agb2. For any function f on 
Z(m)® we have its Fourier transform /, and the inversion formulas: 


ye) f@ = > fers 


cr) f0) = 4 flat, 


whose verifications are simple exercises. 
For any prime ideal p in Q(u,,) not dividing m, and a € Z(m) we define 


J(a, p) = I(x», Xp"). 


We extend the definition to fractional ideals of Q(u,) prime to m by multi- 
plicativity, thus defining J(a, a) for all a prime to m. We have: 


J 0. J(0, p) = —(Np — 2). 
We get J(0, a) by multiplicativity. We also need the congruence 
J1. J(0, a)Na = 1 mod m?. 


By multiplicativity it suffices to prove it for prime ideals. In that case it is 
immediate, since m divides Np — 1, and by J 0, 


—(Np — 2)Np = 1 — (1 — Np)?. 

If a,, or ag, OF a, + a2 = Omod™m, then we shall say that a is special. 
Otherwise we say that a is non-special. The absolute value of the Gauss sum 
determined in GS 2 immediately implies a corresponding result for the Jacobi 
sum, namely: 


J 2. J(a, a)J(a, a) = Na_ if ais non-special. 


If a is special, a 4 0, note that J(a, a) = 1 or —1. In all cases, we have 
J 3. I(a, ~) = —> xe W)yp%2(1 — u) = > IB, v)r2 
u b 


where the Fourier coefficient —J(b, p) is the number of solutions u of the 
equations 
Xo(u) = C2 and yp(1 — u) = £2. 
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By multiplicativity, it follows that the Fourier coefficients J(b, a) are integers 
for arbitrary a, that is 


J(b, a) € Z. 


For the rest of this section, it will be convenient to assume that all number 
fields are contained in the complex numbers. 

We have seen that 6[a] is in the integral group ring Z[G]. For any non-zero 
element « € Q(u,,), we let 


w(a, a) = J(a, (a))a~ 4 if a is non-special, 
w(a, «) = J(a, (a)) if a is special, a 4 0 
w(0, a) = 1. 


As usual, («) is the principal (fractional) ideal generated by a. 
If dis an integer prime to m, then trivially from GS 5, 


o4J(a, a) = J(da,a) and o,w(a, a) = w(da, a). 


Theorem 4.1. The algebraic number w(a, «) is a root of unity. 


Proof. As («) ranges over all principal fractional ideals, the numbers 
w(a, «) form a group. It will therefore suffice to prove that these numbers 
have absolute value 1, for then their conjugates also have absolute value 1, 
and these numbers form a finite group. In case a is special the theorem is 
true by definition. Otherwise we can use J 2, so that 


J(a, (a))J(a, (a) = Na. 


On the other hand, the product of « and its conjugate is equal to Na 
under the hypothesis that a, + a, # 0 mod m. Indeed, we have 


la] + [—a] = > (<» ¥ (at. _ (@ tad op 
12d (eB) « Cote) = te to 


If ¢ is a real number and not an integer, then 


and 


oz} 
ceZ(m)* 
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§4. Jacobi Sums as Hecke Characters 


operates multiplicatively like the absolute norm. The desired relation for the 
product of a and its conjugate follows at once. The theorem follows by 
using J 2, the analogous relation for the Jacobi sums. 


The next theorem was proved originally by Eisenstein for prime level, and 
by Weil [We 2] in the general case, which we follow. 


Theorem 4.2. [f a is an algebraic integer in Q(Utm), and « = 1 (mod m?) 
then for all a we have w(a, a) = 1, so for a non-special, 


J(a,(a)) = 041, 


Proof. We fix a and view J, w as functions of a, omitting « from the nota- 
tion. In the Fourier inversion relation, we know that the Fourier coefficients 
J(6) are integers. But « = 1 (mod m?) implies that 


w(a) = J(a) (mod m?). 


This is obvious from the definition if a 4 0, and follows at once from J 1 
if a = 0. Hence w(b) is an algebraic integer for all b. Furthermore, for d 
prime to m, 


o4i(b) = yD, ogw(ayg-# 


= ¥ w(dayt-a 


It follows that w#(b) € Z for all 6. Now by the Plancherel formula, 
P 1 
>, WO)? = 53 D wa)? 


Since we know that |w(a)|? = 1, and (6) is an integer for all b, it follows that 
#(b) # 0 for a single value of b, and is 0 for all other values of b. In particular, 
for this special b, 


w(a) = w(b)0?. 
But w(0) = 1, so #(6) = 1. Putting a = (1, 0) and a = (0, 1) we get: 
w(1, 0) = J(1,0) = 1 and w(1,0) = C% 
w(0, 1) = J(0,1) = 1 and w(0, 1) = C2. 
It follows that 
w(a) = 1 


for all a, thus proving the theorem. 


1. 


Character Sums 


§5. Gauss Sums Over Extension Fields 


We prove in this section a theorem of Davenport-Hasse [D-H]. 


Theorem 5.1. Let F = F, be the finite field with q elements, and let E be a 
finite extension. Let 


Trp and Nop 
be the trace and norm from E to F. Let 


Xe = YX? Nejr and Ag = A ° Tzyr- 
Then 
— Sx(xz, Az) = (— S(x, A) FL 


Proof. Let m = [E: F]. For any polynomial 


F(X) = x" + c,X"-1 a ne Co 


with coefficients in F, define 


WY) = ACer)x(Co). 


Then 


yw: Monic polynomials of degree => 1 over F—> F 


is a homomorphism, i.e., satisfies 


W(fg) = WS)W(g). 


We write n(f) = deg f. From unique factorization we have the formula 


[ie Dwar bs Nia 


where the product is taken over all monic irreducible polynomials over F. 


Suppose f is of degree 1, say f(X) = X + c. Then we see that 


> WP)x = Sy, AX. 


nf=1 


On the other hand, if m > 2 we have 


> Wx" =0. 


nfysn 


Indeed, 


WS) = g"-? > Alex) > xCc0), 


n(fyan 


and the sum over c, in F on the right is 0, as desired. 
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Therefore we find 


1 
(1) 1+ Sy, A)X = Di aver 


Mutatis mutandis, using the variable X™ instead of X, we get 


(2) 1+ Sexe, Ag)X™ = Il imnors 


where the product is taken over all monic irreducible polynomials Q over E, 
and 


Wx(Q) = xa(Co(Q))Ax(er(Q)). 


We shall write the product over Q as 


Peed Eh 


Q POP 
Each irreducible polynomial P splits in FE into a product 
P= (Q,---Q,. 
Let n = n(P) = deg P. Then 
deg Q = n/r. 


If « is any root of P, then [F(«):F] = n and the field F(a) is independent of 
the chosen root. We have the following lattice of fields. 


F(a) 
mir n[r 
F' = EN F(a) 


F 


All the polynomials Q; are conjugate over F, and their coefficients generate 
the field F’ = EM F(a), of degree r over F. We have 


r= (m,n). 
These facts are all obvious from elementary field theory. Since 
Nae = Neyr ° Neyer, Taye = Tryp ° Tyr, 
and 
NrjrCo(Q) = co(P), TrirCi(Q) = c\(P), 
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we get 
W(Q) = (x(Co(P))A(ex(P)) BF 
= yW(P)™". 


With a view towards (2), we conclude that 


(3) I] ( — wQ)X™@) = 1 — p(Pyrxmrryr 
= [To - very 


= I (1 — y(P\EX)"). 


For this last step, we observe that the map 
gig 


gives a surjection of ft, —> Um/r, and the inverse image of any element of 
Lm is a coset of pu, since r = (m,n). This makes the last step obvious. 
Substituting (3) in (2), we now find 


1 
1+ Ss@e4)X"= |] lla 


ém=1 P 


T] Gd + 8@ DEX) 


Ema 


1 + (—1)"*1S(y, Ay" x". 


This proves the theorem. 


§6. Application to the Fermat Curve 


Although we do not return in this book to the applications of Gauss sums to 
algebraic geometry, we cannot resist giving the application of Davenport-— 
Hasse [D-H], Hua—Vandiver [Hu-V], and Weil [We 1], [We 2], [We 3] to 
the computation of the zeta function of a Fermat curve. 

We keep things to their simplest case, the method applies much more 
generally. We consider the Fermat curve V = V(d) defined by 


i a ga oe Sale| 


with d > 2, defined over a finite field F with g elements. Again for simplicity, 
we suppose that d divides gq — 1, and therefore dth roots of unity are con- 
tained in F. 

We let w: F* — y,_, be the Teichmuller character, and 


yx = character such that y(u) = w(u)@- "4, 
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If a is an integer mod d, we let y*(u) have the usual value if u 4 0, and for 
u = 0 we let: 


470) = 1 ifa=0, 
x40) = 0 ifa¥O. 


For u in F, we let: 


N,(u) = number of solutions x € F such that x¢ = u. 


Then 
1 ifu=0 
Nu) = <0 ifu 4 0, wis not dth power in F 
d ifu 4 0, wis dth power in F. 
Therefore 


Nau) = > xu). 


a@modd 


Theorem 6.1. Let N be the number of points of V(d) (in affine space) in the 
field F. Then 


N= 4? — (q—1)> x*% DJG, 2). 


The sum is taken over integers a, b satisfying 0 <a<dand0<b<d, 
anda + b # 0(mod d). 


Proof. We have 


N=> > x(w)x?(v)x"(w) 


a,b,c L(u,v,w)=0 


where the sum over u, v, w is taken over triples of elements of F lying on the 
line 


utv+w=0. 


The sum over a, b, c is taken over elements in Z mod d. 

The term for which a = b = c = 0 yields a contribution of q?, that is the 
number of points on the line in F. 

Next, suppose that in the remaining sum, one of a, b, c is 0 but not all are 
0 in Z/dZ. Say a = 0 but b ¥ 0. Then we may write the sum 


= > xx) a. (0), 


utv+w=0 certain u,w 
and the sum on the far right is 0. This shows that all the terms in the sum 
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with one, but not all, of a, b, c equal to 0 give a contribution 0. Hence we get 


N=P+ DY DY COOH) 
O0<a,b,c<d ut+v+w=0 
where the sum over a, b, c is taken over positive integers satisfying the in- 
dicated inequality. 
If w = 0 then y°(w) = 0. We may therefore assume that in the inner sum, 
we have w # 0. We then put 


u=u'w and v=v'w. 


The inner sum then has the form 


> yrtete(w) 


(u')V?(0’). 

w#d w+v=-1 

Ifa + 6 + c ¥ Omod d, then the sum on the left is 0. Otherwise it is g — 1, 
which we assume from now on. Since 0 < a, b, c < d, there is no such triple 
(a, b, c) with a + b = 0 mod d, because any accompanying c would have to 
equal d. Hence the sum over a, b, c is for a + 6 # 0 mod d, and then c is 
uniquely determined. Changing back the variables u’, v’ to u” = —u', v” = 
—v’ and taking into account the value of the Jacobi sum yields the expression 
as stated in the theorem. 


Let N be the number of points of V(d) in projective space in the field F. 
Then 


N=1+(q— DWN. 


Therefore we obtain: 
Corollary. N=1+q- > a,b 
where Gay = x2*"(—1)J(x*, x”), and (a, b) are as in Theorem 6.1. 


Let N, be the number of points of V(d) in projective space over the field 
F, of degree v over F. The theorem applied to F, instead of F yields an 
analogous expression, the character x being replaced by yx, such that for 
ueF,, 


_ (q¥-1)id (a¥-1Xq-I)Ka-Dd — L+qt--+q@¥~1ly\q-1yd 
Xu) = ou) ou) ou ) . 


This last expression is nothing but x composed with the norm map, in other 
words, it is precisely the character lifted to the extension as in the preceding 
section. The additive character is also lifted in a similar fashion. Therefore 
by Theorem 5.1 we find 
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Note that the power of y(— 1) also behaves in the same way as J when lifted 
to F,. Indeed, if q is odd then 


l+q+--:+q""1 = vmod 2, 


and if g is even, then 1 = —1 in F. 
The zeta function Z(V, T) is defined by the conditions 


Z'/Z(T) =->,N,T’-* and Z(0) = 1. 
It is then immediate that 


IQ = a7) 
Z(V(a), T) = —— eS :- 
MA) = GT = a) 
This is best seen by taking the logarithmic derivative of the last expression 
on the right-hand side. The operator 


ae id 


is a homomorphism, so we take the operator for each linear term. Inverting 
a geometric series we see that the logarithmic derivative of the last expression 
on the right-hand side has precisely the power series 


> Nr: 


Since it has the value 1 at T = 0, it is the unique function having the desired 
properties. 


If finally one starts with the Fermat curve defined over the field of dth 
roots of unity, and one reduces mod primes » not dividing d, one can take 
the product of the zeta functions for the reduced curve over the correspond- 
ing finite field. Then as Weil remarked, since the Jacobi sums are Hecke 
characters, it follows that the Hasse zeta function 


((V(d), s) = I] Z(Vid), Np~°) 


is equal to a Hecke L-series (up to the obvious factors of the zeta function of 


Q(ug) at s and s — 1). 
The computation of solutions in finite fields works in essentially the same 
way for diagonal equations 


@,X491 +--+ + a,x," = 0, 
as in Hua—Vandiver [Hu—V] and Weil [We 1, 2, 3]. The additional connection 


with the Hasse zeta function for the curve over number fields was made by 
Weil. 
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Stickelberger Ideals and 
Bernoulli Distributions 


The study of ideal classes or units in cyclotomic fields, or number fields 
(Iwasawa, Leopoldt), of divisor classes on modular curves (e.g., as in [KL]), 
of higher K-groups (Coates—Sinnott [Co 1], [Co 2], [C-S]) has led to purely 
algebraic theorems concerned with group rings and certain ideals, formed 
with Bernoulli numbers (somewhat generalized, as by Leopoldt). Such ideals 
happen to annihilate these groups, but in many cases it is still conjectural 
that the groups in question are isomorphic to the factor group of the group 
ring by such ideals. 

However, it is possible to study these ideals, the structure of their factor 
group, and the orders of the factor groups in the group ring, without any 
allusion to the applications to ideal classes, divisors, or units. This chapter 
gives the foundations for such study, applicable to many contexts. 

The first section gives Iwasawa’s computation of the index of the Stickel- 
berger ideal for k = 1, directly applicable to the ideal class group in cyclo- 
tomic fields. Next we deal with the basic theory of Bernoulli numbers and 
polynomials, and especially integrality theorems of Mazur and Coates— 
Sinnott. The sections concerning Stickelberger ideals for k > 2 are taken 
from Kubert-Lang [KL 8]. The last sections on distribution relations are 
from [KL 5] and Kubert [Ku]. 

For a discussion of conjectures in the case of totally real number fields, cf. 
Coates [Co 3], [Co 4], and the very general conjectures in Coates~Lichten- 
baum [C-L]. 

The present chapter is organized so that a reader interested especially in 
the structure of the ideal class group in the cyclotomic tower (the basic sub- 
stantial example of the theory) can read the first section, and then can go 
immediately to Chapter 3, followed by Chapter 5 without impairing the 
logical understanding of the material. I followed this pattern when J taught 
the course in 1977. 
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On the other hand, a reader especially eager to get into p-adic L-functions 
can concentrate on this chapter and then read Chapter 4 as a continuation 
omitting Chapter 3. Only the section on the p-adic regulator in Chapter 4 is 
related to Chapter 3. Chapter 2 may then be interpreted as giving the basic 
congruence properties of Bernoulli distributions, and Chapter 4 gives 
essentially more (p-adically) global measure theoretic properties. 

A third alternative is to see Chapters 3 and 4 as forming a pair, describing 
side by side the complex and p-adic class number and regulator formulas 
originally conceived by Leopoldt. 

§1. The Index of the First Stickelberger Ideal 


Let G = Z(m)* be the Galois group of Q(u,,), and assume that m is the 
conductor of that field, so that m > 1, m is odd, or m is divisible by 4. We 
let 


M = t order of G = 4¢6(m). 
We let 
R=Z|G], e =}1-—6.,),  e* =}(1+0_,). 


For any G-module, we let A~ be the (— 1)-eigenspace for o_,. Then multipli- 
cation by e~ is the projection operator on this eigenspace (provided 2 is 
invertible), and e~ is the associated idempotent in the group algebra. 


Lemma 1. We have RT = 2e-R = (1 — o_,)Rand 
(e~R: R-) = 2”, 


Proof. The inclusion (1 — o_,)R < R°™ is clear. Conversely, let P be a set 
of representatives in Z(m)* for Z(m)*/ +1. Let 


a= > 2(e)o51 € Ro 


with coefficients z(c) ¢ Z. Thus o_,% = —a. Then z(—c) = —2z(c). If we let 


B= > x(0)o57, 


ceP 


then a = (1 — o_,)f, thereby proving the lemma, because e~R is a free 
abelian group of rank M. 


We recall the primitive Stickelberger element 


g’ = >» 1, 
Ze <é “ 
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We have written 6’ instead of @ because we are now setting more permanent 
notation, and there is a more canonical element which has priority, namely 


c 1\ _ c 2 
os (<e» ~ 5)e% 1 dB ( Ce »)o: a 
It is immediately verified that 
(*) e-0'=96, andso 0=86-. 


We are interested in ROM R. The next lemma does away with a possible 
alternative definition of this ideal. 


Lemma 2. ROO R= (RVOR). 
Proof. Let T = RO’ CO R. Clearly 
T- <e-RO= RO and JT CR, 


so the inclusion > is obvious. Conversely, let «¢ ROO R. It will suffice to 
prove that « € R@’ (because « € R and « = a7). Write 


a = > 2(b)o,0 = PEO eD = 5)o* 


From the hypothesis that « has integral coefficients, we conclude that 


be 1\ _ 
> 2(b) (7: 2: 5) = 0(mod Z) 
for all c prime to m, so that 
1 _ 
= > 2(b)b = 5 2 2(b) (mod Z). 
We contend that 


> 2b = 0(mod m) and > 2(b) = 0(mod 2). 


This is obvious if m is odd. Suppose m even, so m is divisible by 4. Write 
m = 4m). Each b is odd, and 


> 2(6)b = 0 (mod 2m) 
so > 2(b) is even. Then 
> 2(b)b = > z(b) (mod mZ), 


thus proving also the first congruence. Only the second will be used. 
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Now let s(G) = >a be the sum of the elements of G in the group ring, 
and note that 


e*6’ = 45(G) and (1 + o_1)6’ = s(G). 
Then 
a = > 2(b)oye 6’ = > z2(b)o,(1 — £*)6" 
= > 2(b)0,6" — > z(b)ore* 0" 
= > 2(b)o,6" — > 2(b)4s(G). 


Substituting s(G) = (1 + o_,)0’ on the right and using > z(b) even shows 
that « lies in RO’, and concludes the proof. 


It is of interest to determine the index arising from Lemma 2. This is done 
in the next lemma. We let as usual: 


w = number of roots of unity in Q(ftm). 
Lemma 3. (RO: ROA R) = w. 
Proof. We define a homomorphism 


T: RO6> + 2Z/z 
Ww 


by mapping an element of the group algebra on its first coefficient mod Z. 
In other words, if 


“= > a(c)o,, 


we let Tx = a(1). Note that 


T@)=i- (mod Z), 


us 
m 


and therefore that T is surjective. It now suffices to prove that its kernel is 
ROO RK. But we have 


6,020 = bad (mod R), 
whence for odd b prime to m, and ae R, we get 
T(o,00) = bT(«6) (mod Z). 
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If a is in the kernel of 7, it follows that «9 also lies in R, thereby proving the 
lemma. 


We now assume that m = p" is a prime power. Then 
S = ROAR 
is called the Stickelberger ideal. We want to determine the index 


(R7: Ff). 


Bux 7 a K@)B: (< ») 


for any character y on Z(m)*. Let y’ be the primitive character associated with 
y, and let m’ be its conductor. Then it is easy to verify that if we replace m by 
m’' and y by x’ in the right-hand side, we obtain the same value, so B,,, is 
independent of whether we view x as primitive character, or simply a charac- 
ter on Z(m)*. (The above fact is a special case of the distribution relation, 
discussed in the next section.) 

Next, we shall use the fact that 


Define 


x0) = Biz #0 


for odd characters y. For primitive y the non-vanishing of B,,, comes from 
its relation with the L-series, and will be briefly recalled in Chapter 3. Cf. 
also [L 3], Chapter 14, Corollary of Theorem 2.2. 


Lemma 4. (RO: Rmé) = m™, 


Proof. This is obvious if one can show that R@ is a free abelian group of 
rank M. When m is a prime power, this results from the fact that for odd x 
we have 


x0) = Bz F 0. 


We shall analyze (R~ : /) by the sequence of groups and subgroups 
shown in the following diagram. 


2M ? 


eR -S— R- —2- # 


mT] [| —B,,|U U} ~ 
xodd 


Rn RO 
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We have shown the inclusion relations, and we have also indicated the in- 
dices. All of them have been proved, except the one on the left-hand side. 
This will be the item in the final lemma, and we then find: 


Theorem 1.1 (Iwasawa). Assume that m is a prime power. Then 


(R-:PA)= wT] - FB, 


x odd 


Remark. Even though some inclusions go opposite to each other in the 
diagram, to compute indices one still has multiplicativity, with opposite 
inclusions occurring with opposite exponents. Cf. §4 if you don’t find this 
obvious. 


Lemma 5. (e-R:e7Rmé) = +m™ ial By y 


xodd 


Proof. First observe that the sign is whatever is needed to make the right- 
hand side positive. Multiplication by e~m@ is an endomorphism of QR-, 
which is a semisimple algebra, decomposing into a product of 1-dimensional 
algebras corresponding to the odd characters. Consequently we find 


det(e- m0) = ial x(m0) = m™” Il By 
xodd xodd 


On the other hand, ¢~m@ maps e~ R into itself, and by standard elementary 
linear algebra, the index is given by the absolute value of the determinant. 
This proves the lemma, and the theorem. 


Remark. In Chapter 3 we shall prove that the index computed in Theorem 
1.1 is the order of the (— 1)-eigenspace of the ideal class group in the cyclo- 
tomic field, denoted by h~. The analytic class number formula will show that 
the product of — B,,, yields the positive sign. 


The theorem and its proof are due to Iwasawa [Iw 7]. It was generalized 
to composite levels m by Sinnott [Si]. In the composite case, one cannot deal 
any more with a single element 6, but one has to deal with the module 
generated by Stickelberger elements of all intermediate levels 


Do BCG) =" 


for all divisors d of m. A similar situation had already arisen in the analogous 
situation in dimension one higher, concerning the Stickelberger elements 
formed with B, rather than B,, in the Kubert—Lang series [KL 2], [KL 3], 
[KL 5]. 
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§2. Bernoulli Numbers 


We recall first some general notions concerning distributions, defined by 
Mazur following the work of Iwasawa. 

Let {X,} be a sequence of finite sets, and suppose given a sequence of 
surjective maps 


Rn+1: Xne1 er Xny 
so that we can consider the projective limit 
A oo Xk => XxX, > Xi. 


For convenience, we took our family of sets indexed by the positive 
integers. In applications, it often occurs that the sets are ordered by the 
positive integers ordered by divisibility. For instance, the family of sets 
Z/NZ arises in the sequel. We shall also consider the projective family 


{Z/p"Z}, 


with a fixed prime number p, and = 0, 1, 2,.... In each case, the connecting 
homomorphism 


ty: Z/NZ > Z/MZ 
for M|N is reduction mod M, denoted by ry. 


This type of projective family will also arise in isomorphic form as 
follows. We have an isomorphism 


5 Z/Z > Z/NZ 


given by multiplication with N. We then have a commutative diagram 


! 
wy LIZ > LINZ 


win | 


YL/L—> Z/MZ 


where the left vertical arrow is multiplication with N/M, and the right arrow 
is reduction mod M. Thus the system 


we} 


is also a projective system, ordered by divisibility. 
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Let us now return to the general projective system {X,,}. For each n suppose 
given a function ¢, of X, into an abelian group V. We say that the family 
{~,} is compatible if for each n and x € X, we have 


PAX) = > Pnvrld)- 


Mn-1y =x 


The sum is taken over all the elements of X,,, lying above x. In what follows, 
we often omit the subscripts, and write zy = x, for instance. 

Let K be a ring of operators on V. Let f be a function on X,, for some 
integer m, with values in K. Ifn > m, then we view fas defined on X,, through 
the natural projection on X,,. We conclude at once from the compatibility 
relation that 


> fox) = 2 F(®)@n(X)- 


xEX, 
Let X be the projective limit 
X = iim X,, 
< 


with the limit topology, so that X is a compact space. For each n we have a 
surjective map 


T,: X > X,. 


For each x € X,, the inverse image r, ‘(x) is an open set in X, and the totality 
of such open sets for all n, x is a basis for the topology of ¥X. 

A function f on_X is called locally constant if and only if there exists » such 
that f factors through X,. Such functions are also called step functions, and 
their group is denoted by St(X, K). For each such function, we can define its 
integral 

| P40 = > fero.c, 
xEXy 
independent of the choice of n such that f factors through ¥,. We then call 
the family {g,}, or the functional dg, a distribution on X. It is an additive map 


do: St(X, K) > V. 


Examples of such maps will be given later with Bernoulli numbers. 

Let K be a complete field with respect to a non-Archimedean valuation, 
and suppose that V is a non-Archimedean Banach space over K, i.e., V is a 
complete vector space, with a norm 


| |: V>R?t 
satisfying 
lv + w| < max{|o|, |w]}} v,weV 


lcvoly = |elxloly ceK,veV. 
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If g is bounded, i.e., |~,(x)| is bounded for all n, x € X,, then we say that g 
is bounded, or quasi-integral for the valuation. For any fe St(X, K) we have 


|| #40] < Islet. 


where || f|| is the sup norm of f, and ||| is the sup norm of the values |¢,(x)]. 
Indeed, if f factors through X,, then 


jr 


by the non-Archimedean property, so our assertion is clear. 

In particular, if fe C(X) is a continuous function on X, then we can 
approximate f uniformly by a sequence {f,} of step functions, and since 
If — fall + 0, we get 


>, fe) e.(%) 


xEXy 


< max IF) lon 


fn , Fn|\ >0 


for m, n> co. Hence the integrals 


| tao 


converge, and define the integral 


| tae 


for such a continuous function, provided that g is bounded. This will be the 
case in important examples, and bounded distributions are also called 
measures. 

All this is preliminary to defining the distributions which are of importance 
to us, namely the Bernoulli distributions. If x € Z(N) then x/N can be viewed 
as an element of Q/Z. For any t € R/Z we let <t> be the smallest real number 
>0 in the residue class of t mod Z. What we want is for each positive integer 
k a polynomial P,, with rational coefficients, leading coefficient 1, such that 


the functions 
k-1 x 
xr>N PC N ») 


form a distribution on the projective system {Z/NZ}. Such polynomials will 
be given by the Bernoulli polynomials. Let the Bernoulli numbers B, be 
defined by the power series 


B1. Fé) =344 = > B4- 
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Then for instance 


Observe that 
F(-t) — F(t) =1, 
so that F is almost even, and in particular, we have 
B, =0 if kis odd, k 4 1. 


We define the Bernoulli polynomials B,(X) by the expansion 


B2. F(t, X) = -> B,(X) a 


Then it is clear that the Bernoulli numbers are the constant terms of the 
Bernoulli polynomials, that is 


B, = B,(0). 
We find: 
B,(X) = 1, B,(XY) = X — 5, BY) = X?-— X +4. 


The desired distribution relation is implied by the next formula. 


N-1 
B3. B(X) = N*- > B,(~+ *). 
a=0 


N 


Proof. On one hand, we have 


tei t+ at 4: N-1 Ntel(X + minine 


es Phere 


amr 
ime X + a\ (Nt)* 
= 7 2, >, B N ) k! 
< X+a\] t* 
= 130m) |e 
2 - i N k! 


On the other hand, summing the geometric series > e” directly from a = 0 
to a = N — 1 and using the definition of the Bernoulli polynomials shows 
that the coefficient of t*/k! is precisely B,(X), thereby proving the desired 
identity. 
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Relation B 3 can also be written in the form 


B 4. NewS BC» + x) = B<Ny>) 


t mod N 


for y € R/Z. This can be interpreted as follows. 


(4 


xt> M®-IBAKx>) forxe i Z/Z 


On the projective system 


the association 


defines a distribution. 


Proof. If ye(1/MN)Z/Z is one element such that Ny = x, then all 
elements in the inverse image of x by the mapping (N-id)~1 consist of 


t : 
yt W with t mod N. 


Multiplying B 4 by *~? yields precisely the distribution relation. 

Since the system {(1/M)Z/Z} is isomorphic to the system {Z/MZ}, we can 
also express the distribution relation on the latter. It is convenient to norm- 
alize this distribution further and to give it a special symbol. For x € Z/NZ 


we define 
1 
EQM(x) = NEE B((% >): 


Then the family {EQ} forms a distribution on {Z/NZ}. 


Remark. Historically, this distribution arose in the context of the partial 
zeta functions. Indeed, if x € (Z/NZ)*, define 


Cn(x, 5) = > nos, 


The Dirichlet series converges only for Re(s) > 1, but it is classical and 
elementary that it can be analytically continued to the whole complex plane, 
and Hurwitz has shown that 


ty(x, 1 —k) = —-EM (x) fork = 1. 
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Furthermore the partial zeta functions themselves satisfy the distribution 
relation. For a further discussion, cf. Example 4 at the end of the chapter. For 
distributions associated with zeta functions in connection with Cartan groups, 


see [KL 10]. 
For the applications, we shall use one more formula concerning the 
Bernoulli polynomials, namely 


B5. B,(X) = X* — 4kX*-+ + lower terms. 


This is obvious by the direct multiplication of the series 


t t* t* 
aay = > BG and e%* = De 


For what we have in mind, we don’t care about the lower terms, which have 
rational coefficients. 


Let N be a positive integer, and let f be a function on Z/NZ. We form the 
polynomial 


te + X)t 


Ft, = > /@ 


We define the generalized Bernoulli polynomials (relative to the function f) 
by 


ao tk 
B6. Ft, X) = > BAX) a 
k=0 . 


In particular, the constant term of B, ;(X) is the generalized Bernoulli number 
B,,¢ = By, (0). 

For instance, f may be a Dirichlet character y on Z(N)*, extended to Z/NZ 

by the value 0 on integers not prime to N. Then B,,, is the generalized Ber- 


noulli number of Leopoldt. Directly from the definition, we then find the 
expression 


rene Snn(C$9) 


In terms of the distribution relation, this can be written 


1 
1, -{ dE,. 
Beas as e 
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The distribution {E{} is rational valued. We shall be interested in its 
p-adic integrality properties for a prime p. For this purpose, we describe a 
process which integralizes this distribution. For historical comments, see 
below, after Theorem 2.1. 

Let c be a rational number. For N prime to c (i.e., prime to the numerator 
and denominator of c) we define 


E®Yx) = EQ?(x) — cE (cx), 


for x € Z(N). Multiplication by c or c7? is well defined on Z(N) so our 
expression makes sense. If N is a power of a prime p, then we could also take 
c to be a p-adic unit. We can write symbolically 


Exc = E, = c“E,oc7~ 1. 


This distribution satisfies the following properties. 


ayy -L*\_ do e- 1 
El. Bis) = C5) <> + 5) 


Proof. We have 


si) = B (Cy >) ~ &(CS*>) 
-i- 3 (CGD) 


whence the assertion is clear. 


melee 


(N) = yk-1PFW) 
E2. EL) = x*-LEY2(x) mod kD® 


Z{e, 1/c], 


where D(k) is a least common multiple of the denominators of the coeffi- 
cients of the polynomial B,(X). 


Proof. We work with a representative integer x such that 
O<x<N-1. 
We write 
cx =b+yN 
with an integer b satisfying 0 < b < N — 1 and ye Z[I1/c]. Then 


ctx 


b b 
WN FayaCity>ez 
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with some integer z. Since B,(X) = X* — 4kX*~1 + lower terms, we find 
the following congruences mod N/(D(k))Z[c, 1/c]: 


ne-lB(C 5») _ om (C52 ) 
awl) 3 (a) | 
- were (Ft y-2) F(R +92) | 


k 

2 
ae Bagee x* k-lpy ak k-1 
= — 5% — | - ke CZ ct 5b 


= pee (X _ Co * eek 
= kx' (z KF) + r) 


and Property E 2 follows by using E 1. 
The values of E{) are in 


ee 


ED®) Z[c, 1/c]. 


They will be called N-integral if they are p-integral for every prime dividing 
N. 

Theorem 2.1. (i) The values of E{®) are N-integral. 

(ii) We have the congruence for every prime p dividing N: 


EQ@x) = x¥- LEQ) mod NZp. 


(iii) If c is an integer prime to 2KN and to the denominators of the Bernoulli 
polynomial B,(X), then the values of ES? lie in Z. 
Proof. For large integer v the values N’/k D(k) are N-integral. Let M = N’. 
The distribution relation yields 


EQYx) = > EL) 
y 


where the sum is taken over those y mod M which reduce to x mod N. The 
expression for E{ is obviously N-integral except possibly for the term 
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(c — 1)/2. But if N is even then c is odd, so (c — 1)/2 is N-integral, and if N 
is odd, then (c — 1)/2 is N-integral. If we apply E2 to each term E(y) 
then we see that the first two assertions are proved. 


For case (iii), we take M = (NkD(k))’ for large v. The argument then 
proceeds as before, because the only denominators occurring in 


EMG) or eM CSFD) 


contain only primes dividing Nk D(k). 


For k = | the integralizing process already appears in the Stickelberger 
theorem, and was used extensively by Iwasawa. For k > 1, Coates—Sinnott 
obtained integral elements in group rings by this process [C-S 2], Theorem 
1.3 and [C-S 3], Theorem 1. Mazur formulated this integralizing process 
in terms of measure theory and the distribution relation, which allows the 
jacking up argument used to prove Theorem 2.1. 


For the rest of this section, we let N = p" with some fixed prime number p, 
so the distributions are defined on the projective limit of Z(p"), which is 
none other than the p-adic integers Z,. We view the values of the distributions 
to be in C,, the completion of the algebraic closure of the p-adic numbers. 
We may express Theorem 2.1(ii) in the limit as follows. 


Theorem 2.2. Let c be a p-adic unit. Then 
Ex,,o(x) = x*- TE, (x). 


We shall now express Bernoulli numbers in terms of the integralized 
distributions. 


Theorem 2.3. Let ce Z} and let k be an integer > 1 such that c* # 1. 
Then 


1 2 
B, = Tae if Kees dE, (x). 


Proof. By definition, 


dE, + i c* dE,(c- 1x). 


Zp 


1 
-B -| dE, = 
k k Z k 


Zp 
On the last integral to the right, we make the change of variable 


X E> cx, 
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which gives 


dE, (x) = dE,{c a 1x), 


Zp Zp 
The formula we want drops out by using Theorem 2.2. 


Corollary 1 (Kummer Congruence). Let « be a residue class mod p — | and 
a # 0. Then for even positive integers k = « mod p — 1, the values (1/k)B, 
are all congruent mod p, and are p-integral. 


Proof. Select c to be a primitive root mod p so that 
ck # 1 modp. 


Then | — c* isa unit at p. The values 1 — c* and x*~1 mod p are independent 
of the choice of k in the residue class mod p — 1, and the corollary then 
follows from the expression of (1/k)B, as the integral of the theorem. 


Corollary 2 (Von Staudt Congruence). Let k = 0 mod p — 1, andk even. 
Then 


B, = -5 mod Z,. 
Proof. Suppose p odd for simplicity. Let c =1 + p. An easy induction 
shows that 
c* = 1 + pk mod p*kZ,. 
Hence 
Tom =~ 7g + OW), 
and so 


Be | x*-1dE, (x), 
Pz, 


because the integral over pZ, is = 0 mod p. An approximating sum mod p 
for the integral over Z* is 


DPA ee) 


Since c = 1 + p we have 
> ae 
Dp P 
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The desired congruence follows from the fact that 


p-1 
> x* = —1 mod p. 


x=1 


We leave p = 2 as an exercise. We merely wanted to show how classical 
congruences can be handled systematically from integration theory. 


Let f be any function on Z/NZ. We defined 


now S nom (C$) 


In terms of the distribution notation, this can be written 
1 
k Bus = f dE;,. 
Zp 


We shall apply this when f is a character of finite order on Z*, so that f is 
an ordinary Dirichlet character on Z(p")* for some positive integer n. As 
usual, for such a character, we define its value to be 0 on elements of Z(p") 
which are not prime to p. Then by definition, for any character of finite 
order on Z* we have the formula for the Bernoulli-Leopoldt numbers 


1 
1 Bag = Ly dE, 


Note: When y = 1 we do not have (1/n)B,,, = (1/n)B, because y is 0 on 
pZ, by definition. 


Theorem 2.4. Let w be a character of finite order on Z}. Then 


1 1 : 
Fs Bry = T- Woe [woe dE, (a). 


Proof. We write dE, = dE,,, + c" dE,°c~*, or in other words 
1 
Bay = |W dene + | Wade" dE,(e“*2). 
Integrals are taken over Z*. We let x+>cx in the second integral. Then 


W(c) comes out as a factor. Using Theorem 2.2 concludes the proof. 


Theorem 2.5. Let 2 < k < p — 2. Let w: Z(p)* > Zi be the Teichmuller 
character such that 


(a) = a(mod p). 
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For any integer n = 1 we have 


1 1 
Ss Bao" = k B, (mod Dp). 

Proof. Let wy = w*-". Choose c to be a primitive root mod p, so that 
c* # 1 mod p. By Theorem 2.3 we get 


1 1 : 
i Be= Toe [2 1 dE, (x) (mod p). 


By Theorem 2.4 we have the congruence mod p: 


1 


tes ope geet ool cs 
n Bu i Be ~ ie F —wWcyc? 1 =| dE, (x) 


because 1 — w(c)c* and 1 — c* are p-units. Since the expression in brackets 
under the integral sign is = 0 (mod p), the theorem follows. 


The next sections, §3 through §7, taken from Kubert-Lang [KL 8], deal further 
with the integrality properties of Stickelberger ideals. 


§3. Integral Stickelberger Ideals 


Let k be an integer >2. Let N = p" be a prime power with p > 3 until §6. 
We let: 


G = Z(N)* if k is odd 

G = Z(N)*/+1_ if k is even. 

R= Rg = Z[G] and R, = Z,[G]. 

deg: R —> Z is the augmentation homomorphism, such that 


deg( > m,6 ) => m,. 


ceG 


This augmentation homomorphism extends to the complex group algebra 
by linearity. 


R,, = ideal of R consisting of those elements whose degree is = 0 mod m. 
If J is an ideal of R, we let J, = ITO Rn. 
card G = |G|. 


s(G) = Do. 


oeG 
For any & € R we have 


6s(G) = (deg ¢)s(G). 


If J is an ideal of R, we write d = degJ to mean that d is the smallest 
integer =O which generates the Z-ideal of elements deg € with ¢ in J. 


43 


2. Stickelberger Ideals and Bernoulli Distributions 


Let B,(X) be the kth Bernoulli polynomial. We let 


wy =wrZia( C$) 
co)" Z (a( C$) mao 


k-1 


; N 
= 6, — k B,s(G), 


where B,, = B,(0) is the Ath Bernoulli number. We have: 

deg9#0 and deg6’ #0, fork even. 
In fact, these degrees can be computed easily. We need only that they are 
#0 for k even, but the computation is as follows. Suppose k is odd. We use 
the distribution relation. Summing over all primitive elements, i.e., elements 


of p” yields the value of the distribution summed over all elements of level 
p”"~*. Continuing in this fashion reduces the computation to level 1. But 


1 a 1 1 
ma S op ¢ 2S) =. BO) =; Be 
oO ae ACS) RBWO) = 5 Be 


The degree of @ arises from the same sum but with the term a = 0 omitted. 
Hence 


— pk-1 
deg 0 = 1- pt" B, 


k 
and 
k-1 
deg 0’ = deg 6 — x B,,|G| 
or 
ee 1 — peo? Neo} 4 
deg o' = (=F _ 1 gop) a. 


These formulas would also be valid for k even, except for our convention to 
take G = Z(N)*/+1. This requires dividing the formulas by 2 to get deg 0 
and similarly for 6’. The non-vanishing for k even comes from the functional 
equation of the zeta function. 

Next we give the ideals used in integralizing the distribution. 
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J(N) = ideal of elements > m(b)o, such that 
> m(b)b" = 0 (mod N) 


I®(N) = ideal of elements o, — c* with integers c prime to N. 


Since k and N remain fixed, we often write 9 and 6’ instead of 0,(N) and 
0,(N). Similarly, we write J and J, or J and J. It is obvious that 


J® > J), 


We shall determine the extent to which J 4 Jin Lemma 4. 
We have: 


deg I(N) = p', where ¢ is the maximum integer such that k = 0 mod $(p’). 


This is obvious, because deg J“(N) is generated by the integers | — c* with 
c prime to p. 


Theorem 3.1. (i) We have 
RO, AR = I6;,. 


In fact, if an element & € R is such that 0’ € R, then Ec I™. 
(ii) On the other hand, letting If = Z,I, we have 


R,O;, fa) R, — I6,,. 
If an element € € R, is such that 0 € R, then € «I. 
Proof. First we prove that for any prime >2, we have 
I0’< R, and 1,0 ¢ R,. 


A similar property is due to Mazur and Coates—Sinnott, as mentioned before. 
Indeed, we have 


asa, — c¥)0, = > ESXaoz? 


aeG 


where 


ERG) = NAAT [>(<% >) . omms( "> )| 


The p-integrality then follows from Theorem 2.1(i). For other primes we 
need a lemma. 
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Lemma 1. The polynomial (1/k)(B,(X) — B,(0)) maps Z into Z and maps 
Z, into Z, for every prime 1. 


Proof. A standard property of Bernoulli polynomials states that 
7 (BAX + 1) — BX) = x#-1, 


Hence for any integer m we see recursively that the first assertion of the 
lemma is true. The second, concerning /-adic integers, follows by continuity. 
The lemma is also valid for p = 2. 


We may define E,, by using B,(X) — B,(0) instead of B,(X) in the 
definition of £,,. The lemma shows that J6’ < R. 
For convenience we let 
BX) = BX) — B,(0). 
Lemma 2. (i) Let € € R and suppose that €6' € Z,[G] = Ry. Then Ee J. 


Gi) Let € € R, and suppose that €0 € Ry. Then €€J, = Z,J. 
Proof. Write € = > z(b)o, with integral coefficients z(b). Then 


f= ND F207 Bi % Joe, 


and therefore 


k-1 
se > 2B, (<F>) igpeintegtal: 
b 


But an elementary formula for Bernoulli polynomials, obtained directly 
from the definition, gives for an integer b, 


N*E-1 b EO NE-1 (k b\k-i 
k Bi) = 2, ke (;) (x) 


Comparing the leading term modulo all the lower order terms, and taking 
into account that B, = —4 is p-integral (here we use p # 2), and the Kummer 
theorem that B, is p-integral for i < p — 1, we find 


k 
= = 0 mod } Zp. 


Multiplying both sides by KN proves the lemma. 
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Lemma 3. Let p* be the smallest power of p such that p‘0;, is p-integral. 
Then 


s=n-+ ord,k. 


We have I OZ = (p'). 


Proof. The argument uses the same expression for the Bernoulli polynomial 
as in the previous lemma. We see that 


k-1 (k k-i 
p’ x (1) 2(5) is p-integral. 


The leading term is p’/kN. The Bernoulli numbers B, are p-integral for 
i <p -—1 by Kummer, and for i > p — 1 the power N*~? in front in- 
tegralizes (1/N)*~'. It follows that 


p 
kN is p-integral, 


whence s has the stated value. Since we have already seen that J0’ < R, it 
follows that the p-contribution of J. Z is exactly p’. It is clear that IN Z 
is equal to (p*), because we can always select 


c=1modN and c=O0Omod/ 


for any prime / # p to see that IM Z contains elements prime to /. This 
proves the lemma. 


Lemma 4. We have J = I + ZN, and (J: 1) = p*-™ = p™*, 


Proof. It is clear that N e J. Conversely, write an element of J in the form 


> m(c)(o, — c*) + > m(c)c*. 


The first term is in J, and the second term is an integral multiple of N. This 
proves the lemma. 


We may now conclude the proof of the theorem. We prove (i). Suppose 
€e Rand €0’e R. By Lemma 2, é ce J. By Lemma 4, we know that 


€ =zNmodI for some ze Z. 
We know that J0’< R. Hence zN@’« R. By Lemma 3, it follows that p* 


divides zN, so € € J, and the theorem (i) is proved. The part (ii) is proved the 
same way. 
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§4. General Comments on Indices 


Let V be a finite dimensional vector space over the rationals, and let A, B 
be lattices in V, that is free Z-modules of the same rank as the dimension 
of V. Let C be a lattice containing both of them. We define the index 


(C: B) 


(AaB = oa 


It is an easy exercise to prove that this index is independent of the choice of 
C, and satisfies the usual multiplicativity property 


(A: D)(D: B) = (A: B). 
Furthermore, if E is a lattice contained in both A and B then 


(A: EB) 


(A: B) = re 


We leave the proofs to the reader. 

Suppose that A is not only a lattice, but is an algebra over Z. Let 6 be an 
element of QA = V and let m be a positive integer such that m0 € A. Assume 
that 0 is invertible in QA. Then 


(A: AO) = +detg, 8, 


where the determinant is taken for the linear transformation of QA equal to 
multiplication by 9. This is easily seen, because 


(A : AO) = (A: Am6)(Ameé : AO) 
and 
(Am6 : A) = (AO: Amé)~?. 
Since m6 lies in A, the index (A : Am6) is given by the absolute value of the 
determinant of m6, which is m’ det 0, where r is the rank of A. This power 
m' then cancels the other index. 
Note that the determinant can be computed in the extension of scalars by 


the complex numbers. In particular, if A is a semisimple algebra, and is 
commutative, then 


deto =| [ x(6) 
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where y ranges over all the characters of the algebra, counted with their 
multiplicities. In the applications, the algebra is essentially a group ring, so 
the multiplicities are 1, and the characters come from characters of the group. 

This will be applied to the case when 6 = 0. We recall the definition of 
generalized Bernoulli numbers according to Leopoldt: 


names Zam (C$) 


Thus 
1 
x0) = k By. 


Note that the Bernoulli number is defined with respect to G, so that for k even, 
we are summing over Z(N)*/+1. This convention is the most useful for 
present applications in §5 and §6. (We revert to the other convention in §7.) 
For even k, it gives half the other values. 

The classical theorem about the non-vanishing of B,,, when k and y have 
the same parity gives the desired invertibility of the Stickelberger element 6,, 
in the corresponding part of the group algebra over Q. 


§5. The Index for k Even 


We let s = n + ord, k, and t is defined as in §3, to be the maximum integer 
such that kK = 0 mod @(p’). We regard Ry A RO (for k even) as the Stickel- 
berger ideal. We shall prove: 


Theorem 5.1. 


1 
(Ro: Ro 0 RO) = Np*4*-t] [ + 5 Boe 


xF1 


First observe that since deg 0 and deg 6’ # 0 we have 
Ro O RO = Ry XO RO. 
By Theorem 2.1, we conclude that 
RO’ AR =I’, andhence ROO Ro = 16’. 
But Ry + 16’ = R, where 
d = deg I0’ = (deg J)(deg 6’). 
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In §3 we had noted deg J = p‘. The factor deg 6’ will cancel ultimately. In 
any case, we have: 


(Ry: Ro 0 RO) = (Ry: Ro 0 RO’) 
= (Ro 5 1,0’) 
= (R,: 10’) 


_ (RR: 16’) 
~ (R > Ra) 


a TR : R6’)(RO’ : 16’) 


= TTT x0'xR: D. 


The product is taken over all characters y of G. We separate this product 
into a factor with the trivial character, giving deg 6’, canceling that same 
factor in d, and the product over the non-trivial characters. For x non-trivial, 
we have (8) = x(6’). 

In the final step we also wrote (RO’ : J0’) = (R: 1). This is because 6’ is 
invertible in the group algebra over Q. Hence the map €++ €6’ induces an 
isomorphism on R. 

We are therefore reduced to proving a final lemma. 


Lemma. (R: J) = p® where s = n + ord, k. 


Proof. We have (R: 1) = (R: J)\(J: 1). Any element € in R can be written 
in the form 


Cs > m(c)e. = > m(c)(o, — c*) + > m(c)c*. 


From this it is clear that (R: J) = N, and the index (J: J) is obvious, thus 
concluding the proof. 


Remark. Of course we have not determined the sign occurring in the 
product of the Bernoulli numbers. It is the sign which makes the product 
come out positive, and which one determines easily from the functional 
equation of the zeta function and the factorization in L-series. This is irrel- 
evant for our purposes here. 


§6. The Index for k Odd 
Assume k is odd. Note that 9 = 6’. Let 
e~ = (1 — o_,) 


be the idempotent which projects on the (—1)-eigenspace. It is immediate 
from the definition that @ is odd, that is, 


6-0 = 6. 
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The Stickelberger ideal in this case is ROM R = JO, and is odd. We shall 
prove: 


Theorem 6.1. 
(R~ : ROQ R) = Np i + x Bye, y- 


yoda 
The rest of the section is devoted to the proof. 
Lemma 1. We have R~ = 2e~ Rand (e~ R: R~) = 292, 
Proof. This is the same as Lemma | of §1. 
We then proceed as in the even case. First we write 


(e—R: e710) 


(R- : 10) = (ERR) 


and then 
(eR: e-J0) = (eR: 7 RO)\(e- RO: e~ IO) 
= [| xOe-R:e-D) 


yoad 
because 9 is invertible in e~ Q[G]. Furthermore, 
(e-R:e-1) = (e-R: R-\R™ : 2e-DQe-T: 2-1) 
= (R7 : 2e7J) 

because (2e-I': e~I) = 2-9/2 since e~J is free of rank ¢(N)/2. Finally, 

Lemma 2. (R~ : 2e71) = p* where s = n + ord, k. 

Proof. The group 2e~J is generated by elements of the form 

(¢, — G_,) — c*(o, — o_). 

An element € € R™ lies in Z(a, — o_,) mod I. Hence the same argument as 


in the past case gives the desired index. 


§7. Twistings and Stickelberger Ideals 


The Stickelberger elements 6, should really be indexed by the groups to 
which they correspond. We now want to compare factor groups of the group 
ring by various Stickelberger ideals, twisted in various ways. Consequently, 
it is not useful any more to have G different in the even or odd case. For this 
section, we let NV = p” still, and we allow p = 2. We let 


G, = Z(p")*. 
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We define 
9x.,(P") = Ge (Ge — c*)O,(p") € Z(p")[G,].- 


This makes sense since we know from §1 that 6, ,(p”) is p-integral. 
Let V be a Z(p")[G,]-module. We define its twist to be the tensor product 
with the roots of unity, 


V1) = V® py. 
Then o in G operates diagonally, 
o(v@ y) = ov@oy, and a,(v@y) = alo @ »). 


We let y be a basis for zy over Z(N). Note that the element a on the right 
makes sense as an element of Z(N) since V @ pty is a module over Z(N). 
From the definitions we then get the formula 


TW 1. O,,.(0 ® y= 9-100 @ Ys 


resulting from Theorem 2. 1 (iii), 
E,,(a) = a*~1E,,(a) mod N. 


The distribution relation allows us in E 2 to replace N by high powers of NV 
at a higher level, and then return to level N to get this congruence. 

In particular, if 6,,_;,. annihilates V, then 6, annihilates V(1). The argu- 
ment simply extracts in a general context the argument given by Coates— 
Sinnott [C-S 2] in connection with the ideal class groups in cyclotomic fields, 
see their Theorem 2.1. 

Take V to be Z(p")[G,] itself, so that V(1) is generated by a single 
element o, ® y. The map 


or> Cai @ y) 


gives an isomorphism 
Z(p")Gn] > Z(p" Gn] @ Mer. 


Let %.(p") = ideal of Z(p")[G,] generated by the elements 6,,,.(p"). Then 
the isomorphism induces a bijection 


Sd p") > F,-1(p") @ Mp. 


Hence we get an isomorphism 


TW 2. An| Fp") => An @ bp"! Fe—1(P") ® bp, 


where A, = Z(p")[G,] is the group ring. 
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We may then pass to the projective limit. The limit of A, is the Iwasawa 
algebra. We let -%, be the ideal generated by the elements 6,,,, (projective limit 
of 0,,,-(p”)). We obtain an isomorphism with the twist, 


A] SF, > A()/Fe-1()- 


This isomorphism permutes the eigenspaces for the action of u,_,, and this 
can be interpreted in terms of congruence relations between Bernoulli— 
Leopoldt numbers (with characters) in the obvious manner. 

We now make remarks concerning twistings, ideal classes, and modular 
curves. We assume that the reader is acquainted with the latter. Suppose 
N = p is prime # 2, 3. The Iwasawa—Leopoldt conjecture predicts an iso- 
morphism 

C~ & (RA), 


where C ~ is the p-primary part of the (— 1)-eigenspace of the ideal class group 
in Q(u,). On the other hand, Kubert-Lang [KL 7] establish an isomorphism 


O(Xi(p)) & Rol A, 


where @°(X,(p)) is the cuspidal divisor class group on the modular curve 
X,(p), generated by the cusps lying above the rational cusp on Xp (p). 
Consequently, we expect a commutative diagram: 


R* (pF p) => R-(p)® byl Alp) @ Up 


€(X,(p) Mp) —S— Co(p) ®@ be 


It remains a problem to give a direct isomorphism at the bottom, from some 
sort of geometric construction. This may in fact lead to a proof of the 
Iwasawa—Leopoldt conjecture. 


§8. Stickelberger Elements as Distributions 


In this section we follow Kubert~Lang [KL 5] to describe a “‘Stickelberger 
distribution” associated with a distribution on Q/Z, and to give its basic 
properties. 

Let A be a function on Q/Z (with values in some abelian group, but for 
the rest of this section, we shall take values in some algebraically closed field 
F of characteristic 0). We say that / is an ordinary distribution if it satisfies the 
relation 

h(r) = > A(t) 
Dt=r 
for every element r ¢ Q/Z, and positive integer D. The sum is taken over 
those elements ¢ such that Dt = r. In the application we have in mind, A 
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will be obtained from the first Bernoulli polynomial, and generalizations on 
(Q/Z) lead to the higher Bernoulli polynomials. See [KL 5] for k > 1. 


We let G(V) ~ Z(N)*, writing the isomorphism as a+> oy. We let A be an 
ordinary distribution as above. We define 


hy(x) = (CH >) for x € Z(N). 


For any function f on G(N) we define (as usual) 
Swf, hy) = > f@hy(a), 


with the sum taken over ae Z(N)*. If we define f on Z(N) to be 0 outside 
G(N) then we see that 


Su(f hn) = | f dh 


By abuse of notation, we often write a € G(N) instead of ae Z(N)*. 
Let Zy = (1/N)Z/Z and let re Zy. We define 


evr) = TANT Ps A(ra)oz?. 


If the values of # are in the field F, then the values of gy are in the group 
algebra F[G(N)]. It is clear that if M is a denominator for r, i.e., 7€ Zy and 
M divides N, then the image of gy(r) under the canonical homomorphism 
G(N) — G(M) is equal to gy(r). Thus we may define 


g(r) = lim gy(r) 


in the injective limit of the group algebras (as vector spaces over F), ordered. 
by divisibility, with the injections from one level to a higher one given by 
sending one group element to the sum of all the group elements lying above 
it under the canonical homomorphism. 


Theorem 8.1. The function g: Q/Z— lim F[G(N)] is an ordinary distri- 
bution. 


Proof. Immediate from the definitions. 
We define g to be the Stickelberger distribution associated with h. 


Let Ay be the vector space generated by the values g(r) with re Zy 
(essentially the same as the vector space generated by the values g,(r)). We 
observe that g(0) is a constant multiple of the augmentation element, that is 


_ AO) 
8) = TET Din” 
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Let y be a character of G(N) and let m = m(x) be its conductor. We define 
SCG A) = Sums Mm) 
where y,, is the character on G(m) determined by x. We let 
G,(N) = set of characters y such that S(z, h) # 0. 


Theorem 8.2. The dimension of Ay is equal to the cardinality of G,(N). 


Proof. The space generated by the elements gy(r) with r € Zy is clearly a 
G(N)-module since 


Ongn(r) = gy(rb), for be GIN). 


We let the idempotent associated with y be the usual 


If M is the conductor of y, then 


tu( a7 )ee = SE raaay ex 


as one sees at once from the fact that ra depends only on the residue class of 
amod M, for ae G(N). Hence Ay has a non-trivial y-component if S(x, A) 
# 0. This shows that the dimension of Ay it at least that which we asserted. 

On the other hand, let r€ Z, and suppose r has exact period M. Let x be 
any character of G(NV). Then 


an(r ey = aT - > h(ra)x(a)e, 


eG(N) 


FEIT se MO) de He 


eEG(M redya=b 
If the conductor of x does not divide M, then x is non-trivial on the kernel 
of the reduction map 
redy: G(N) > G(M), 
and the sum on the right is 0. If the conductor of y divides M, then x(a) = 
7(b) on the right, so 
1 |G) 
= h(rb b 

BHC )s = TENT scbin MO [acany| HO 


G(M) 


= TK Wi . a A(rb)x(b)e,. 
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Since we can write r = a/M with some a prime to M, a change of variables 
in the sums shows that up to a non-zero constant factor, gy(r)e, is equal to 


Su(Xus hye, 


We now have to analyze this sum. The next lemma will show that this sum 
is equal to some factor times S,,(%m> Am): 


Lemma. Let y be a character of G(N) with conductor m. 


(i) If every prime dividing N also divides m then 


Swans hy) = Sm(Xms hy). 


(ii) Let p be a prime dividing N but not dividing m. Write N = p"M with 
Pt ™M. Then 


Sy(Xn» Aw) = (L — Xm(P))Su(xus Am). 


Proof. The first statement is immediate from the distribution relation. Let 
us prove (ii). We have 


>, Kaila = > 10) > ns): 


aeZ(N)* eZ(M)* a=b(M) 
aeZ(N)* 


By the distribution relation, we know that 


b x a a 
(Gz) ~ 2") ~ dp Ue) * 2, HH) 
M poe N oo N >. N 
x=0(M) a=0b(M) a=0(M) 
The elements a in Z(V) which are not primitive but are = b mod M are in 


bijection with the elements c € Z(N/p) satisfying the conditions 


a=pc and c=p~tamod M, 
under the map 
C+ pe 


which sends Z(N/p) into pZ/NZ < Z/NZ. Therefore the sum over primitive 
elements lying above a given b can be expressed as a difference 


5,8) -8) - Ea) 


(where the sum is taken over c € Z(N/p), c = p-tamod M) 


=o) Ae 
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(by the distribution relation). Plugging this into the first relation, and making 
a change of variables b +> pb, we find 


Sul, hy) = Sua Au) — 2. non( 2) 


beZ(M)* 
= (1 — ym(p))Su(ym, hm). 


This concludes the proof of the lemma. 


In applying the lemma to the theorem, we note that the y-component is 
at most one-dimensional, and has exactly dimension 1 under the stated 
condition S(x¥, h) 4 0. This concludes the proof of the theorem. 


A distribution can be decomposed as a direct sum of an odd and an even 
distribution, provided that its image is contained in some module on which 
multiplication by 2 is invertible. 

In the next section, we shall prove that the rank of the values on Zy is 
at most |Z%|, where Z;; is the set of primitive elements in Zy. 

If we take for A the distribution arising from the Bernoulli polynomial 


A(r) = B,(<r>) ifr 4 0, AO) = 0 


then the non-vanishing of B,,, for odd characters x shows that A has the 
maximal attainable rank for an odd distribution. Consequently, we find: 


Thoerem 8.3. The Stickelberger distribution g associated with h(r) as 
above is the universal odd ordinary distribution into modules on which 
multiplication by 2 is invertible. 


So far, Theorem 8.3 has been proved only for distributions with values in 
a field of characteristic zero. However, the next section will give a result of 
Kubert showing that the universal distribution is generated on Zy by free 
generators whose cardinality is |Z}|. This will take care of the additional 
integrality possibilities allowed in the statement of Theorem 8.3. 

Later in the book, we shall see that the cyclotomic units in the cyclotomic 
field form an even distribution, which has maximal rank by the class number- 
regulator formula, cf. Chapter 3, §3, and Chapter 6, §3. 

The direct sum then yields a distribution of maximal attainable rank. 
This is one method to show that the universal distribution in Theorem 9.1 (ii) 
has rank |Z}|. 


§9. Universal Distributions 


In this section we give a theorem of Kubert [Ku 1], [Ku 2], constructing a free 
basis for the universal distribution on (1/N)Z/Z. In [Ku 2] Kubert gives a 
complete treatment of the ordinary universal distribution on Q*/Z* for arbi- 
trary k, as a GL,(Az)-module, where A, is the ring of integral finite adeles. 
Here we limit ourselves to k = 1, and give only the abelian group structure. 
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For simplicity of notation we let 
1 
Zy= y LIZ and ey = 


We let 
g: Q/Z — some abelian group 


be an ordinary distribution, in other words we suppose that for re Q/Z, 
and a positive integer D we have 


> g(t) = g(r). 


It is clear that such distributions form a category, and we wish to construct 
the universal distribution. 

We let Zx be the set of primitive elements in Zy, i.e., elements having 
period exactly N in Zy. 


The prime power case 


Let N = p” be a prime power and write N = MD, a factorization with 
M > 1. Let re Zj;. If Dt = r then it is immediate that te Z* (N = prime 
power is used here). The distribution relation shows that g(r) is an integral 
linear combination of the images of the primitive elements g(t). Hence 0 and 
these primitive elements generate the universal distribution, at level N. 

We have 


2 g(t)=8(0) and > g(t) = 2(0). 


tEZnip 


Hence we get one relation among primitive elements, 


>, a(t) = 0. 
teZNn 
Let 
Ty = Zi — {ey} and T* = {0}. 


Ty = Ty U TH. 


Theorem 9.1. (i) The elements g(Ty) generate the abelian group generated 
by g(Zy). 

(ii) If g is the universal distribution then the elements g(t) with t€ Ty are 
free generators. 

(iii) The cardinality of Ty is equal to that of Z%. 


Proof. The first statement is obvious from the preceding remarks. The 
cardinality of Ty is clearly equal to that of Z%*. For (ii), we may consider the 
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free abelian group generated by the elements of Z¥ and {0}, modulo the 
single linear relation 


> =o. 


teZN 


We can then define g on Z# to be the canonical homomorphism in the factor 
group, and for re Z% with M # N and M|N we can define 


g(r) = >. g(t), with D = N/M. 


It is then clear that g defines a mapping on Zy satisfying the distribution 
relation. 


The proof of (ii) is in some sense natural, but in many ways it is better 
to exhibit mappings which are distributions and which have the appropriate 
rank to get the lower bound for the rank of the universal distribution. Cf. 
the end of §8, where we exhibit natural distributions in the theory of cyclo- 
tomic fields which have such rank. 


The composite case 


To state the theorem concerning the universal distribution in the com- 
posite case, we shall write elements of Zy according to their partial fraction 
decomposition. Let 


n= []|p@. 


iz1 


Then 
172-=@-427/2 
NO oS pe 


and 


where aq; is well defined mod p?, while a is well defined mod N. We let: 


Ty = set of elements a/N as above, such that either a, is prime to p; and 
a, ~ 1, or a, = 0. 


It is then clear that Ty has cardinality (4). 


Theorem 9.2. The preceding theorem holds with this definition of Ty, for com- 
posite N. 
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Proof. The proof will be a simplification of Kubert’s proof by Katz. Let 
Ay be the abelian group generated by g(Zy). A distribution having the lower 
bound ¢(N) for its rank has been exhibited in §8. Since Ty has this cardinality, 
it will suffice to prove that g(Ty) generates Ay. We first show that the elements 
g(a/N) with a such that a; is prime to p, or a, = 0, generate Ay. We do this by 
induction. 

Let 

be 
Dis 


be an arbitrary element of Zy. Write b; = pia, where a, is 0 or prime to p. 
If a, = O then we are through by induction, so we can assume that a, is prime 
to p, and 1< r < n,. Then: 


Pid *) ( ( ay Cj )) 
dt Dk aye wt ise Pit 


ay J CG 
j 2, ae Po. -y «) 


by the distribution relation. Since r < n, it follows that 


oi 2 Hh, 
Pit pi ph 


where aj is prime to p. 

Inductively, we may now repeat the same argument with respect to po, 
Ps,...- It merely suffices to observe the following. In the first step of the 
argument, when we factored out pj, thus changing 5; to c,, if b; is prime to p 
then c; is prime to p. Thus performing the same argument inductively on the 
other primes does not destroy the desired property for those primes which 
have already been taken care of. This concludes the first part of the proof. 

Secondly, we show that we can recover those elements a/N for which a; 
may be equal to 1 from the prescribed set Ty. Let 


N’=Niph, yet Z/Z. 


From the distribution relation, we find: 


> “(4 + ») = g(phy) 


n 
1 
j mod pf Pi 


k 
eam + ») = g(pi~y). 


k mod p31~+ 
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Subtracting yields 
J = 
> (+ + y) = 0Omod Ay, 
jmod Ph. Pr i 
G,p)=1 


where Ay. is the group generated by g(Zy-). This yields 


nt?) Ge +) 

—gl—= + = — + y) mod Ay. 

(= y 2, glo +? x 
(a4,p1)=1 

Observe that the same quantity y occurs on both sides of this relation. We 

may now repeat the procedure inductively on the partial fraction decomposi- 

tion of y. If we write 


and say a, = |, we get a similar congruence 


ay a, Ae 
-g(S+y,)= (+24 ) mod Aye, 
s( oa ys] Zz alo + om t 2 y 
(ag,Pq)=1 


where N” = N/p%z. In this way we reduce the proof to the case when N con- 
tains fewer prime factors, and then can apply induction with respect to the 
number of prime factors to conclude the proof. 


§10. The Davenport—Hasse Distribution 


In this section we give a relation of Davenport-Hasse [D-H]. Let F, be the 
field with g = p” elements, and let g = 1 mod m. We follow the notation of 
Chapter 1, §1. We let p be a prime in Q(u,_;) lying above p, and let PB be a 
prime in Q(u4-1, Hp) lying above p. We write as usual 


a = B mod* B 


to mean that «8-1 = 1 mod mg, where mg is the maximal ideal in the local 
ring at ®. We use similar notation mod* p or mod* p. We let x, yw be characters 
on F*, and put 


(x) = —S(y, A). 
Theorem 10.1. (Davenport—Hasse) We have 
T [eGW) = tC, m) 


mad 


where CW, m) = yWm-") TT rp. 


xm=a1 


61 


2. Stickelberger Ideals and Bernoulli Distributions 


Proof. Let u,,(y) be the quotient of the left-hand side by the right-hand side, 
that is 


Thaw) 
t(W")C(, m) 


We have to show u,,(y) = 1. First note that u,,(y) lies in Q(u,-1). This is 
immediate by looking at the action of o,,,, cf. GS5 of Chapter 1, §1. From 
the fact that |S(y)| = /q ify ¢ 1 and |S(W)| = 1 ify = 1, we conclude that 
|un(w)| = 1. Similarly, all conjugates of u,,(y/) have absolute value 1. Since 
S(W)S(v) = +4, we know that only primes dividing p occur in the factoriza- 
tion of S(w). We shall prove that 


Unt) = 


(1) Un(W) = 1 mod B. 


This will imply that u,,(y) is a unit, and therefore a root of unity. If p 4 2, 
this congruence (1) implies that u,(w) = 1. If p = 2, we shall give the argu- 
ment at the end of the proof. 

To prove the congruence, we simplify the expression in Stickelberger’s 
theorem. For any integer k we had defined s(k) = s,(k) and »(k) = y,(k) in 
Chapter 1, §2. We let r(k) = r,(k) be the unique integer such that 


O<r(k)<q-—1 and k=r(k)modg-— 1. 
Lemma 1. Let 0 < k < q — 1. Then 


k—s(k) 
k! = (—p) ?-! yk) mod* p. 
Proof. By induction. Suppose first that p{ k. Then ky > 1, and 


s(k) = s(k — 1) +1, Wk) = vk — ko. 


The assertion is then obvious from the inductive step for k — 1. Next sup- 
pose p|k, so k = pk’. Since 


org = fe) +--+ [Ea 
Pp Pp 


and similarly for k’, we see that 


ord, k! — ord, k'! = k’. 
In k! = (k’p)!, the factors not divisible by p give a contribution of 
(p — 1)!=-1 mod p, 


taken k’ times. The product of the factors divisible by p yields k’! p', where 
t = ord, k’!. The lemma is then immediate. 
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As in Chapter 1, we let ¢ = e?”/?. We let z = ¢ — 1. Then from 
(e-1+1)?-1=0 


we see at’once that 
n?-1 = —p mod* z. 


From Stickelberger’s theorem and Lemma 1, we conclude that 


(2) (wo) = a 


This reduces the proof of the congruence relation (1) to the proof of such a 
congruence for the expressions on the right-hand side of (2), corresponding 
to the way u,,(y) is made up from expression t(w~“) for appropriate values 
of k. We shall prove two relations for the residue function, namely: 


(3) > r(x t+ y) = r(my) + 27) 


mMmx= x= 


(4) [ | r@ +»)! = r(my)! moo [I r(x)! 
mx =0 mx=0 

In these relations, sums and products are taken over elements x mod q — 1 

such that mx = 0 mod q — 1. The theorem is immediate from these relations, 

taking into account 


mr = o(m)™ mod B, 


applied to y such that y= w7’. 

We prove the two relations (3) and (4). To begin with, we note that the 
left-hand side and right-hand side of each relation is unchanged when we 
change y in a residue class mod (q — 1)/m. Consequently we may assume that 


gc): 
O<y< oa 


We choose the obvious representatives 


withy = 0,1,...,m— 1. 
Then 
r(x) = x, raxty)=H=xt+y, r(my) = my. 


This makes (3) obvious, and (4) takes the form: 


(5) ee eee a re (» 1—*): mod* p. 
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The products are taken for vy = 0,1,...,m — 1 and y is taken as above, with 
0<y<(q-— 1)/m. Let F(y) be the left-hand side of (5). Then the right- 
hand side of (5) is equal to F(0), and consequently, it suffices to prove that 


F(y) 
F(y - 1) 


with 1 < y < (g — 1)/m, or equivalently 


[1 (y+ 4) 


‘| (my = v) = 1 mod* Ps 


= | mod* p, 


m™ 


or also 


7 my + v(q — 1) 


= * 
Wh = | mod* p. 


v=0 


For this it will suffice to prove that each factor in the product is = 1 mod p. 
But the power of p entering in my — vy is at most p”~?. Dividing numerator 
and denominator of each factor by my — v shows that 


my —v+ vq 


= * 
ea = | mod* p. 


This proves the theorem except when p = 2, when we know only that u,,(y) = 
+1. In this case we argue further as in [D-H]. 
Let / be a prime dividing q — 1. Let 
v= bib 


be the decomposition of w into a product of a character of /-power order, and 
a character of order prime to /. Then 


un = uryr 


is the corresponding decomposition for yw”. Let /” be the highest power of / 
dividing q — 1, and let €» be a primitive /’th root of unity. Let 


A=Cw —-— 1. 
Since yy, = 1 mod A, it follows that y = w, mod J. Therefore 
WwW) = (Wy) moddA and t(W,) = 1 mod A. 


In particular, 


Un(W) = Un(Wy) mod A and u,(y,) = 1 mod 4. 
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Since u,,(y) = +1, it follows that u(y) = 1, thereby proving the theorem. 


Remark. In [Ya], Yamamoto shows that the Gauss sums form the univer- 
sal odd distribution modulo 2-torsion. 


Appendix 


In this chapter we have looked at the distributions which are especially 
relevant to the cyclotomic theory discussed in the rest of the book. It is 
worthwhile to give here a number of examples of distributions occurring 
throughout mathematics, involving various classical objects. We make a list 
of a general nature, including those we have already met. 


(1) The Bernoulli distribution, which is essentially given by a polynomial. 


(2) The Fourier—Bernoulli distribution, giving rise to the Bernoulli distri- 
bution, as follows. For real 6 we have the Fourier expansion 


e2nind 


k! 
B.A<8>) = ~ GniyF >, ro 


Thus we may even define B, on R/Z, and through this Fourier series, the 
function given at level N by 
0 +> N¥-1BA<O)) 


satisfies the distribution relation. 


(3) The holomorphic Bernoulli distribution. Let 


nO 


and restrict z to the unit circle, z = e?”°. Then {N*~1f,} defines a distribution. 
The real part for k even and imaginary part for k odd are mere homomorphic 
images of this one, and give rise to the Bernoulli distribution of (2). 


(4) The partial zeta functions. Let 


~ 1 
CO 2am 


be the Hurwitz zeta function, for 0 < u < 1. For each real number f, let {t} 
be the unique number congruent to ¢ mod Z, and such that 


0<{t}<1. 
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Then for ae Z(M), the function 


valoda} 


satisfies the distribution relation, namely 


b a 
N-s c(s {5 +) - Mt(s, i7}) 
es N M 
The sum on the left is taken for 6 in Z(N) reducing to a mod M. 


(5) The gamma distribution. Define 
te 
V2n 
We view G as defined on Q/Z with the origin deleted, but then with values in 
the factor group 


G(z) = T(z). 


G: QZ — {0} > C*/Qz 


of the multiplicative group of complex numbers, modulo the multiplicative 
group of all algebraic numbers. The classical identity 


7 1 J —_ 1 4-Nz 
[] Tu r (< + i) iar I'(Nz)N 
shows that G defines a distribution. 

Rohrlich has conjectured that G is then the universal odd distribution, 
with values in groups where multiplication by 2 is invertible. This is a con- 
jecture in the theory of transcendental numbers. It also leads to the question 
(in algebraic independence) whether the distribution relations, the oddness 
relations and the functional equations generate an ideal of definition over the 
algebraic numbers for all algebraic relations among the values of the gamma 
function (1//2x)r, with rational arguments. 


(6) The cyclotomic units, which we have discussed. 


(7) The modular units, which may be defined by their g-expansions, 
namely 


g(a) = —q?B2ver*axe1-Y/2(1 — g,) | | (1 — qq.) — 9/42) 
n=l 


where a = (aj, a2) € Q?/Z? and a # (0, 0), where 
Z= At + Qo, 
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and where the value g(a) is to be taken in the multiplicative group of the 
modular function field modulo roots of unity, cf. [KL 3], following the work 
or Ramachandra and Robert. The association 


at-> g(a) 


is the universal even ordinary distribution on Q?/Z? — {0}. The ordinary 
Bernoulli distribution (with k = 2) then appears as a homomorphic image of 
this one. 

In the last three examples, the distribution is not defined at 0. In such cases, 
it is useful terminology to refer to the distribution as punctured. 

Roughly speaking, I expect that in any classical situation where a distribu- 
tion arises naturally, it is universal (odd, even, punctured, as the case may be), 
always subject to taking values in groups where 2 is invertible. 


(8) The Lobatchevski distribution. I am indebted to Milnor for the 


following brief comments which might inspire the reader. Define the 
Lobatchevski function 


7] 
1A ss i log|2 sin t| dt. 
i) 
This is essentially the same as the integral 
6 
-{ log|e2"#* — 1| dt. 
10) 


Since the function t+ |e?" — 1| satisfies the distribution relation, one sees 
at once that A(@) satisfies the distribution relation in the sense that on 
{(1/N)Z/Z} the family {NA(6)} is a distribution, which is odd. 

Let H be hyperbolic 3-space. This is the set of points 


(x1, X2, y)ER x R x R* 


so (x;, X2) is an ordinary point in the plane, and y > 0. We endow H with 
the metric 


dx? + dx2 + dy? 
eg : 


Select four distinct points in the plane, and let T be the tetrahedron in H 
whose vertices are at these points. Then it can be shown that opposite dihedral 
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angles are equal. (The dihedral angles are the angles between the faces of the 
tetrahedron.) Let a, B, y be the dihedral angles. Then 


at+pry=n, 


and the volume of the tetrahedron is precisely given in terms of the Lobatchev- 
ski function by 


[[[Ege® = vor = i) + a0) + 20 


The search for relations among such volumes had led Milnor to consider the 
Lobatchevski function and its relations, now known as distribution relations, 
and to show that it had the maximum rank (its values being viewed as con- 
tained in a vector space over the rationals). Of course, Kubert’s construction 
in fact gives free generators over Z. 

Finally, let 0 = Z[¢], where ¢ is a primitive cube root of unity. Then 


PSL,(0) < PSL,(C) = Aut A, 
where Aut H is the group of automorphisms for the Riemannian structure, 
orientation preserving. The tetrahedron is essentially a fundamental domain 


for PSL,(0). This point of view leads into the problem of determining all 
relations for volumes of fundamental domains in the higher dimensional case. 
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The complex analytic class number formulas date back to the 19th century. 
They relate class numbers of cyclotomic fields and units. They arise by 
factoring the zeta function of a cyclotomic field in L-series, and looking at the 
factorization of the residue. 


§1. Gauss Sums on Z/mZ 


We have to redo the properties developed in Chapter 1, for the ring with 
divisors of zero Z(m) = Z/mZ. The only additional feature arises from the 
presence of non-zero elements which are not units. We let m = J] p™ be 
the prime power product. We then have product decompositions 


Z(m) = ‘ai Z(p™) and Z(m)* = ial Z(p™)*, 


From the product, for any character y on Z(m)* and any character 2 on 
Z(m) we have a decomposition 


z~=[lxu and A=[ [4 
Pp Pp 


If xe Z(m) and x is not prime to m, we define y(x) = 0. We let € be a 
primitive mth root of unity (chosen to be e?"/" over the complex numbers), 
and 


Ax) = C*. 
Observe that Z(m) is self dual under the pairing 
(x, y) re OY. 
69 


3. Complex Analytic Class Number Formulas 


Let d|m. We have a natural surjective homomorphism 
Z(m) > Z(d) 
and also a surjective homomorphism 
Z(m)* — Z(d)*. 


If there does not exist d|m and d # m such that y factors through Z(d)*, then 
we call x primitive. Again to determine the smallest d such that a given 
character factors through Z(d)*, we may look at prime powers. 

Suppose m = p” is a prime power, and yx is a character on Z(p"). Let p’ 
be the smallest power of p such that y is trivial on 


1 + p’Z(p*). 
For convenience, let us abbreviate 
A = Z(p"), 


so 1 + p’A is a group for any positive integer v. The following criterion is 
immediate. 


x is primitive if and only ifr = n. 


The power p” = p™ is called the conductor of y. 
In the composite case, we let the conductor be defined by the product 


e(~) = cond(y) = I] pr. 


It is then clear that c(y) is the smallest d such that y factors through Z(d)*. 
We define 


SQ) = Sa) = > x0AQ0), 


and the sum could be taken only over those x € Z(m)*. It is then obvious that 
we have a decomposition 


Sx A) = ] Sete Ay) 


where the sum S, is taken over Z(p™)*. 
If d is an integer prime to m, then, as with Gauss sums over finite fields, 
we have 


S(x, dod) = x(d)S(x, A); 
by making the change of variables x +> d-1x. 
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On the other hand, if dis not prime to m, we have one new significant 
feature. 


Theorem 1.1. [fy is primitive and d is not prime to m, then 
S(y, Aed) = 0. 


Proof. Using the prime power decomposition, we may assume without loss 
of generality that m = p” is a prime power. Abbreviate 


A = Z(p"). 


Also without loss of generality, we may assume d = p’ for some integer 
r > 1,andr <n. Form a coset decomposition 


A* = |_Ju(l + p*-7A). 
Then 
SQ 4p) =D Dd xu + MAp"u) 


xepr— 


= > xu)A(p'u) > x1 + 2). 
i x 
Since x is assumed primitive, it is non-trivial on 1 + p"~'A, and the sum on 
the right is 0, thus proving the theorem. 


From here on we have the same formalism as for Gauss sums over finite 
fields. For any function f on Z(m) we define its Fourier transform 


Tf(y) = 2 SOM xy). 


Theorem 1.2. (i) We have T?f = mf~. 
(ii) If x is primitive, then 


Tx = x-1)S@x7?- 
(iii) Again if x is primitive, then 
SOOS) = m. 


Proof. Part (i) is proved as for the finite field case. For (ii), if y is not 
prime to m, then Ty(y) = 0 by Theorem 1.1. If y is prime to m then we can 
make the usual change of variables to get the right answer. Part (iii) is then 
proved as in the finite field case. 
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§2. Primitive L-series 


Let y be a character mod m. We consider the Dirichlet L-series for Re(s) > 1: 


Usy= >IO- 5 ss 


aeZ(m)* 


Let ¢ again be a primitive mth root of unity. Then we have 


1 pra-mx {) ifn = amodm 
™ x&zm) 1 ifn#amodm. 


Indeed, if a # n(mod m), then the character x +> £¢-™* is non-trivial on 
Z(m). Consequently we can write the L-series in the form 


1< 1 
Ls,y= > xD— > DGS 
aeZ(m) m n=1 x n 
whence also 


Gan 
oat, 


1 ao 
Ls) = 2, S04) DA 
xEZ(m) n=1 


Theorem 2.1. Assume that y is a primitive character mod m. Then 


1 
Ls, == SW > 
bEeZ(m)* 
Proof. If x is not prime to m then the Gauss sum is 0 by Theorem 1.1. 
If 5 is prime to m, we can make the change of variables which yields the 
desired expression. 


So far we have worked with Re(s) > 1. We now want to have the value 
of the L-series at s = 1. It is not difficult to prove that the L-series has an 
analytic continuation for Re(s) > 0. Of course, it is also known (and a little 
more involved) how to prove the analytic continuation to the whole complex 
plane. For our purposes, to get the value at 1, we can work ad hoc, let s be 
real >1, and take the limit as s approaches 1. Then we don’t need anything 
else here. 

We recall a lemma about series. 


Lemma. Let {a,} be a decreasing sequence of positive numbers, whose limit 
is 0 as n> 00. Let {b,} be a sequence of complex numbers, and assume that 
there is a number C > 0 such that for all n, 
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i.e., the partial sums of the series {b,} are bounded. Then the series > a,b, 
converges, and in fact 


n 
= AyD, 
Kaa 


The proof is immediate using summation by parts. 


< Cay. 


We apply the lemma to the series with b, = €~"° and a, = 1/n° with s 
real >0. The partial sums of the 5, are clearly bounded (they are periodic). 
Let 


Zo a me 1. 


For |z| < 1 we have 


A 
—log(l — z) = vee 


AS z—> Zo, —log(1 — z) approaches —log(1 — z,). On the other hand, let 
z=rz withhO<r<l. 


Then the series > z"/n converges to >, z§/n as z tends to z, along the ray 
(that is, r tends to 1). This is again obvious by estimating the tail end of the 
series using the lemma. Consequently, we find: 


Theorem 2.2. If x is a primitive character, then 


m 


Ll, ) = - 92S a) lost — 6°». 


The picture of the roots of unity looks like in the figure. 


If 


1-@= jl a Che, 
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then the picture shows that 
Lath? = |l— ¢-|e-*, 


The branch of the logarithm is determined so that 


Observe that we do not change the sum 


> x6) log — &) 
if we replace b with —b. We shall distinguish two cases. 
We say that x is even if y(—1) = 1, and that y is odd if y(—1) = —1. 


We assume m > 2, and m = m(y) is the conductor of x. 


Case 1. x is even. 


In this case, adding the sum with b and —b yields 
2D, X(6) log(1 — €-») = >! HO[log(1 — ¢°) + log( — ¢-*)]. 


With y even, we obtain the formula 


L(1, ) =-SP S40) loglt - © 


Case 2. x is odd. 


In this case, we let 
C=e"lm and b=1,...,m—1. 
Then 


log(l ~ £-%) = log|t — £-*| + 4(§ - 2) 


log(1 — ¢°) = log|1 — C°| — i(5 = =). 


Thus with x odd, we obtain the formula 


2 
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Remark. Let m be an integer >1 and let y be a non-trivial character on 
Z(m)*. Then either the conductor of y is odd, or it is even, in which case it is 
divisible by 4. Hence for a primitive character, we cannot have m = 2. 

This is in line with a field theoretic property. Consider the field 


K = QUUn)- 


Let m be the smallest positive integer for which we can write K in this fashion. 
Then either m is odd or m is divisible by 4. If m is odd, then the group of 
roots of unity ux, in K consists of +y,. If m is even, then ux = Un. 


§3. Decomposition of L-series 


For the applications we have in mind, we have to deal with two types of 
fields: The cyclotomic field Q(u,,) for some integer m > 2, and its maximal 
real subfield, over which it is of degree 2. We shall use a language which 
applies to the more general situation of an arbitrary abelian extension of the 
rationals (known to be contained in a cyclotomic field), but the reader may 
limit his attention to the two cases mentioned above. Certain proofs can be 
given ad hoc in these cases, while it is easiest to use general class field theory 
to deal with the general situation. I hope that the extent to which I recall 
certain proofs here will make the material readable to any reader not 
acquainted with class field theory. 

Let K therefore be an abelian extension of Q, and let K* be its real 
subfield. We let m be the smallest positive integer such that K < Q(u,) (we 
call m the conductor of K). We assume K # Q, and as said above, you may 
assume K = Q(u,,) or K = Q(u,,)*. We have a surjective homomorphism 


Z(m)* + Gal(K/Q) = Gxia 


Any character y of Gxjq gives rise to a character on Z(m)*, also denoted by y. 
We let m(y) be its conductor. We may view y as factored through Z(m(y))*, 
in which case we speak of y as the corresponding primitive character. If we 
need to make a distinction between y as character on Z(m)* or the corre- 
sponding primitive character on Z(m(y))*, then we denote this primitive 
character by y . The context should always make clear which is meant. 

Let 


ols) =TT (1- x) 


be the zeta function associated with K. It is a fact that there is a decom- 
position 


tx(s) = | [Los, », 
x 
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where the product is taken over all the primitive characters induced by 
the characters of Gxjg. We reproduce the proof in the case K = Q(u,,). In the 
last section we dealt with the Z-series in its additive form. Here we use the 
multiplicative form 


L(s, x) = ial (1 - ap 


where the product is taken over all primes p not dividing m(y). All these series 
and products converge absolutely for Re(s) > 1, and what is to be proved 
amounts to formal identities, localized at each prime p. Specifically, the 
decomposition is equivalent to proving for each prime number p: 


THe) “110-2 


It is therefore convenient to let t = p~*’. As usual, let 
(p) = (P1-+-r)*, Np = p’ 
be the decomposition of p in prime ideals in K. Then 
efr = [K: Q]. 
The identity to be shown is then equivalent to 


a —y =] [Gd — xp). 


Suppose first that p{m. Then e = 1. The prime p generates a cyclic subgroup 
of order fin Z(m)*, 


Z(m)* > {p} > {1}. 


The value of a character y viewed as character on Z(m)* or as primitive 
character are the same on p. There are f distinct characters on the cyclic 
group {p}, corresponding to the fth roots of unity, each such character 
assigning one of these roots of unity to p. Each one of these characters then 
extends in r possible ways to Z(m)*. Since trivially we have the factorization 


1-07 =|[]a—- 2), 


Sat 


we have proved our identity in the case ptm. The argument is, by the way, 
entirely similar if K # Q(u,). 
Suppose secondly that p|m. Write m = p*m’ with (p, m’) = 1. If p|m(y) 
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then by definition y(p) = 0. If p{m(y) then y factors through Z(m’)*. We are 
therefore reduced to proving the identity 


a-##y=[[a-xwp 


where the product is taken over those y whose conductor m(y) is not divisible 
by p, and hence such that y factors through Z(m’)*. The arguments are then 
identical with the preceding arguments, replacing m by m’. This concludes 
the proof. (Cf. [L 1], Chapter XII, §1.) 


As usual, we let r1, 72 be the number of real and complex conjugate 
embeddings of K. 


If K is real then r, = [K: Q], r. = 0. 
If K is not real, then r, = O and r, = 3[K: Q]. 
We let 


We assume known the analytic continuation of the zeta function and 
L-series at 1 (cf. [L 1], Chapter VU, [L 3], Chapter XIV). By comparing 
residues, we have the class number formula: 


CNF. Ce I] L(A, 2). 


diz 


As usual: 


w = Wx = number of roots of unity in K. 
h = hx = class number of K. 
R = Rx = regulator of K. 


d = dx = absolute value of the discriminant. 


If K is real, so rg = 0, then w = 2 and the formula reads: 


a =[] 320, ». 


Leopoldt’s p-adic analogue will be given in the next chapter. If K is not real, 
then we let h+, R* denote the class number and regulator of its real subfield 
and N* is the degree of the real subfield, 


N* = N/2 = 1p. 
We shall also need another fact whose proof is somewhat more delicate. 
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Theorem 3.1. We have product expressions: 


(i) [T]m@ =4 
X#1 
: qil2 if K is real 
S =. 
(ii) I] @) oa if K is not real. 


Proof. It is possible to give essentially algebraic proofs for these facts 
(although the sign of the Gauss sums is always a little delicate, involving 
something about the complex numbers). The best way to see the theorem, 
however, is probably as in Hasse [Ha 1], using the functional equations of the 
zeta function and L-series. Indeed, under the change s+> 1 — s, the functions 


d5!2(7—$!2T°(5/2))*Cx(s) if K is real 


1+ 5\\%2 : : : 
avn(a-vr(gi2r (xr (4) €.(s) if K imaginary 


are invariant. On the other hand, under the transformation 
St>1l—s and yr, 
the following functions (for non-trivial y) 
m(0°'*(n-*7I(s/2)) L(s, x) if y is even 


my)? (xr (+44) )z6, x) if xis odd 


take on the factor 
Vx(— 1m) 
SW 


Dividing the functional equation of the zeta function by the functional 
equation of the L-series, one sees that under s+>1 — s, 


(=@)" takes on the factor Vid mt, 


The theorem then follows at once. 


If we combine the residue formula, Theorem 3.1, and the expressions for 
the values L(1, x) for primitive characters y found in the last section, we then 
get the following factorizations for the product /R in the two cases. 


K real. 
2-1“AR=[] > —x(6)log|l — Crop! 


x#1 bmod m(x) 
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Warning: In this case, N = N*,h = h*, R = R* and characters are even. 


K imaginary. 


22 AR 
= II > — x(b) log|1 — Crayl - | | Bix 
0 


In the real case, we observe that all characters are even. Also the number of 
roots of unity in K when K is real is equal to 2. Otherwise, the formulas are 
just obtained by plugging in. 

It will be convenient to reformulate them slightly, to make the connection 
between imaginary K and the maximal real subfield clearer. We let: 


E = Ex = group of units in K 
E* = Ex+ = group of units in Kt 
Hx = group of roots of unity in K 


x = group of ideal classes in K. 


Lemma. We have the index 


QNI2)-1 R+ 


. PN canes 
(E: peE*) = R 


Proof. This is obvious by computing the regulator of the units in K* with 
respect to K, where local factors of 2 occur in each row of the determinant 
expressing the regulator, whereas a local factor of 1 occurs in the corre- 
sponding determinant giving the regulator of the units in K*. 


Following Hasse, we give a symbol for the index in the lemma, calling it 
the unit index: 


Ox = Q = (E: uxE*). 


Reading the class number formula in the real case applied to K*, we find: 


Theorem 3.2. For imaginary K, 


h=h*Qw2-"? 7] -By. 


yodd 
In the next section, we shall analyze more closely the decomposition 
h=hth-, 
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where h~ is defined as h/h*, and we shall see that A~ is an integer. In any 
case, we have the class number formula: 


CNF-. h- = Qw[ | —-43B,,. 
zyodd 


In the next section, we shall prove that Q = 1 if K = Q(u,,) and mis a 
prime power. In addition, #~ will be interpreted as the order of the (—1)- 
eigenspace of the ideal class group. From Theorem 1.1 of Chapter 2, we find: 


Theorem 3.3. [f m is a prime power, K = Q(Um), G = Gal(K/Q), and FS is 
the Stickelberger ideal, then 


h- = (Z[G]~: FY). 


Let p be a prime number. If A is an abelian group, we denote by A its 
p-primary part. As Iwasawa observed [Iw 7], knowing the index immediately 
shows that: 


The group Cx is generated by one element over Z[G] if and only if there 
is a Z[G]-isomorphism 


Ce & (ZIGH/F). 


Indeed, we know that the Stickelberger ideal annihilates the ideal classes, 
so the isomorphism is obvious if there exists one generating element by 
Theorem 1.1 of Chapter 2. 

Let m = p itself. Iwasawa [Iw 7] and Leopoldt [Le 5], [Le 10] have shown 
that if the Kummer—Vandiver conjecture h* prime to p is true, then the 
cyclicity follows for the p-primary part of C™. (See Chapter 6, §4.) Prov- 
ing the Kummer—Vandiver conjecture, or the Iwasawa—Leopoldt conjec- 
ture that Cx is cyclic over the group ring is therefore one of the major 
problems of algebraic number theory today. 

In the Iwasawa-Leopoldt conjecture it is necessary in general to restrict 
the conjecture to the p-primary component. For example, let F be an imaginary 
quadratic field Q(/—p), and suppose p is such that F is contained in the 
cyclotomic field Q(u,) = K. Then K over F is totally ramified above p, and 
the Hilbert class field of F lifts to an unramified extension of K of the same 
degree, so the ideal class group C; is a factor group of the ideal class group 
Cx, and Cp = C,~. It is known that there exist such fields, e.g., Q(V — 3299), 
for which C, contains a group of type (3, 3), see Scholz-Taussky [S-T]. 
Furthermore, 3 does not divide p — 1, with p = 3299. Consequently all the 
non-trivial eigenspaces for characters of Z(p)* of the local ring group Z,[G] 
are cyclic over Zs. This shows that there cannot be an isomorphism 


Cx” & ZG]-/A, 
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and thus in general, the Iwasawa—Leopoldt conjecture has to be restricted to 
the p-primary component. 
For h* we also get a formula, and it is convenient to introduce the group 


G = Z(m)*/+1, 
and for each even character x the group 
G, = Zim))*/ +1. 


Then there are exactly N/2 even characters and (N/2) — 1 non-trivial even 
characters. Therefore we obtain the other class number formula: 


1 
CNF*. h* = perl] Dd —2@ log — Gua. 


x 


x1#1 bEG 


The product over y # 1 is taken over the non-trivial characters of G,, or 
equivalently the non-trivial even characters of Z(m)*. This product will be 
interpreted as a determinant of certain units in §5, and it will follow that A+ 
is equal to the index of a certain subgroup of the units in the group of all 
units. 


§4. The (+ 1)-eigenspaces 


In this section we analyze in greater detail the factors h* and h~ of the 
class number, and the corresponding unit index. We assume that m is odd 
or m = 0 mod 4. 


Theorem 4.1. Let K = Q(u,). Then Qx = 1 if m is a prime power, and 2 
if m is not a prime power. 


Proof. Let E = Ex be the unit group in K. For each unit u in E, the 
quotient u/u is a unit, of absolute value 1, and for any automorphism o of 
K over Q, we have 


o(@/u) = oujou 


because o commutes with complex conjugation (abelian Galois group). 
Hence all conjugates of ii/u have absolute value 1. Hence i#/u is a root of 
unity. Let 


pg: E> = bx 
be the homomorphism g(u) = ii/u. Then 
WS @(E)C p, 
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because if wu is a root of unity, then g(u) = u~? so the image of @ contains 
the squares. Hence the index of g(£) in pis 1 or 2 because yp is cyclic. Further- 
more, we see at once: 


Ox = 2 if and only if ¢ is surjective, i.e., P(E) = uy. 


Assume that m is composite. Let be a generator of y if m is even, and a 
generator of the odd part of y if m is odd. Then 1 — € is a unit (elementary 
fact, and easy exercise), and g(1 — €) = —€71, so g(E) = uy, in other words 
g is surjective. On the other hand, g(E£* py) = p, so the index is 2 in this case. 

Suppose next that m is a prime power, m = p". We contend that g(E£) # wp. 
It will follow that g(£) = p?, and since the kernel of g is E* the theorem also 
follows in this case. Suppose g(£) = yw. Let € be a primitive mth root of 
unity, and let uw be a unit such that 


ju = —C7}, 
Let 
1—- C 


u 


“gz 


Then «/~ = 1 so « = & and a is real. But 1 — € is a prime element above p 
in K and so « is also a prime element, which cannot lie in the real subfield. 
This proves the theorem. 


Theorem 4.2. Let K = Q(u,). The natural map 
Cxt+ > Cx 


of ideal classes in K* into the ideal class group of K is injective. 


Proof. Let a be an ideal of K* and suppose a = (a) with « in K. Then 
&/a is a unit, and in fact a root of unity as one sees by an argument similar 
to that in Theorem 4.1. Suppose that m is composite. By Theorem 4.1, we 
know that Q, = 2 and @ is surjective, so there exists a unit wu such that 


uj = a/a. 


Then au is real, and generates the same ideal as a, thus proving the theorem 
in this case. 

Suppose that m = p” is a prime power. Let ¢ be a primitive mth root of 
unity, and let 2 = 1 — ¢, so A is a prime element above p in K. We can write 
& = az with some root of unity, and 4/A = —¢ is a generator of uw. Hence 


z= (A/a 


for some positive integer k. Then a/* is real. Since the ideal generated by « 
comes from Kt, and since p is totally ramified in K, it follows that k is even. 
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Hence z is a square in yp, and therefore in the image of @, say z = u/u for 
some unit uv. Then aw is real, and generates the same ideal as «, thus proving 
that a is principal, and also proving the theorem. 


Theorems 4.1 and 4.2 are classical, see for instance Hasse [Ha 1], Chapter 3, 
for more general results. The elegant proofs given here are due to Iwasawa. 


Theorem 4.3. Let K be an imaginary abelian extension of Q. Then the norm 
map 


Nxjxt: Cy > Ct 


on the ideal class group is surjective. 


Proof. We have to use class field theory, which gives the more general 
statement: 


Lemma. Let K be an abelian extension of a number field F. Let H be the 
Hilbert class field of F (maximal abelian unramified extension of F). If 
KOH = F then the norm map Nxip: Cx —> Cf is surjective. 


Proof. For any ideal class c in K, the properties of the Artin symbol show 
that 


(c, KH/K) restricted to H = (Nx) pc, H/F). 


We have natural isomorphisms of Galois groups: 


ne 
ie 


KQNH=F 


Gal(KH/K) ~ Gal(H/F) K 


Hence the group (Nx;rCx, H/F) is the whole Galois group Gal(H/F), whence 
NxirCx = Cp since the Artin symbol gives an isomorphism of the ideal class 
group with the Galois group. This proves the lemma. 


The theorem follows at once, because K over K* is ramified at the 
archimedean primes, and hence cannot intersect the Hilbert class field of F 
except in F. 

Let t denote complex conjugation. Let 


Cz = (—1)-eigenspace of Cx 
= {ce € Cy such that c*** = 1}. 
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Theorem 4.4. Let K = Q(un). Then the sequence 


1—> Cz > Ce > Cet > 1 


is exact. 


Proof. We consider the norm map followed by the injection, 
Ce norm Crt inj Ce 


The kernel of this composite map is Cg by definition, so the theorem is 
obvious by what had already been proved. 


Corollary. The quotient h/h* is an integer, which is the order of the group 
Cx. 


Remark. The integer /~ is called the first factor, and h* is called the second 
factor of the class number, in older literature. This is poor terminology since 
the ordering seems arbitrary, and for several years this has been replaced by 
the plus and minus terminology. 


§5. Cyclotomic Units 


Let m again be the conductor of the cyclotomic field Q(u,,), so either m is 
odd >1 or m is divisible by 4. Let € be a primitive mth root of unity. For b 
prime to m we let 


Then g, is a unit called a cyclotomic unit. It is easy to see that g, is equal to a 
real unit times a root of unity. Indeed, without loss of generality we may 
assume that b is odd, since ¢° depends only on the residue class of b mod m. 
Then 


“ b 
C-’g, for Ve a 


is real (i.e., fixed under g_), as one sees immediately from the definitions. 
We let g; be this real unit, uniquely determined up to sign, and call it the 
real cyclotomic unit. 

We let & be the group of units in Q(u,,) generated by the roots of unity 
and the cyclotomic units. We let &* be the group of units in Q(u,,)* generated 
by +1 and the real cyclotomic units. Then 


E/E x E*/é*. 
Observe that g, and g_, differ by a root of unity. 
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As before, let N = [Q(u,) : Q] and let 


N 


r= 3 - 1. 
Then r is the rank of E, and also the rank of E*. If ¢,..., ¢, is a basis for 
E+ (mod roots of unity), then the regulator R* is the absolute value of the 
determinant 

R(E) = Rt = £ det log|oae;| 
a, 

where j = 1,..., rand ae Z(m)*/+1 and a # +1 (mod m). It is convenient 
to let 


G = Z(m)*/+1 


so we may view ae€G,a# 1inG. 
On the other hand, we may form the cyclotomic regulator 


R(€) = Rec = + det logloag,| 
a,ob#1 


again with a,beG, and of course it does not matter if we write g, or gf 
since the absolute value of a root of unity is 1. 

For composite levels m the cyclotomic units are not necessarily inde- 
pendent, and so we now turn to prime power level, 


m= p". 


We shall prove in this case that the cyclotomic units are independent. 
Interpreting the regulator as the volume of a fundamental domain for the 
lattice generated by the log vectors of units in R’, we see that 


(E:&) = (E* 1 €*) = Reyo[R*. 


Remark. For composite m, as with the index of the Stickelberger ideal, 
it is necessary to consider the group generated by cyclotomic units of all 
intermediate levels to get a group of units of the right rank. 


Theorem 5.1. Let K = Q(u,,) andh = hx. Assume m = p" is a prime power. 
Then 


ht = (Et: @*) =(E:@). 
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Proof. Let G be any finite abelian group. Then we have the Frobenius 
determinant formula for any function f on G: 


[1 > x@fa-») = det [f(ab-4) — f@)}. 


Z#1 aeG 


The proof will be recalled later for the convenience of the reader. It is already 
clear that up to minor changes, this formula yields the theorem, taking into 
account the expression for h* obtained at the end of §3. We now make these 
changes explicit. 


Lemma 1. We have for G = Z(m)*/+1: 
+ det logloag,| = Il > x(b) log|1 — C| 
ab#1 X#1 bEG 
=[ 1] > x bogie. 
X#1 bEG 


Proof. The first expression comes from the Frobenius determinant formula 
(Theorem 6.2), and the second comes from the fact that for non-trivial y, 


> x(6) log|1 — ¢| = 0. 
Lemma 2. Let G, = Z(m(xz))*/+1. For prime power m = p", we have 
>, xB) log|1 — Chal = > x(6) log|1 — Gl. 
bEeGy beG 
Proof. Let m(x) = p’. We write residue classes in Z(p")* in the form 
y=6+ psc, withO <c < p™-5, 


and b ranges over a fixed set of representatives for residue classes of Z(p')*. 
Instead of the sums over G, and G respectively, it is easier now to work with 
sums over Z(p*)* and Z(p")* respectively, and then divide by 2. The desired 
relation is then immediate from the identity 


[] -4Y) = xm - ¥*, 


Am=1 


because we get 


2, H2) logl1 — Gel =D (6) log|l — 


y mod p' 
This proves the lemma. 


Theorem 5.1 is then immediate from the lemmas, and the class number 
formula for h* obtained from the L-series. 
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It is generally believed that the coincidence of group orders in Theorem 5.1 
does not correspond to an isomorphism of the groups involved. Iwasawa has 
a counterexample at least that C+ is not isomorphic to E/€ as Galois module. 
Mazur has pointed out that the analogous statement for the case of elliptic 
curves with complex multiplication is definitely false. 

We conclude this section by mentioning the most classical case of the 
quadratic subfield. For our purposes we are interested in the case of the real 
quadratic subfield. Thus for the end of this section, we let 


m =p with p prime # 2, 3 
and such that K = Q(u,,) contains a real subfield F = Q(vD) with D > 0, 


so D = p, and D is the discriminant. Let ¢ > 1 be a fundamental unit of F, 
and h,; the class number. From 


Cr(s) = Ca(s)L(s, x) 
where x has order 2, we get 


2h, log € 


= L(1, x) 


S 


= - 72 H@) logll ~ ¢. 


It is a simple matter of the theory of quadratic fields that the conductor m(y) 
is exactly D (assumed > 0). The explicit value S(y) can be determined in any 
number of ways (via functional equation, via Dirichlet’s method as in my 
Algebraic Number Theory, Chapter IV, §3, etc.), and we have S(y) = VD. 
Thus we find: 


Theorem 5.2. For a real quadratic field F = Q(V D) as above, 
hy loge = — > x(a) log|1 — CI. 
D 


amod 


We have the tower of fields: 


Z(D)*/+1 


O——_»— 
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Let & be the group of cyclotomic units in K* and let & be + its norm group 
into F, so 


by = + Nxt pe. 


Then &, (mod + 1) is infinite cyclic. 


Theorem 5.3. hy = (Ey : &). 
Proof. Let 
a= [] (-¢) and o = [[ -&, 
x(ay=1 x(ay=-1 


where ¢ is a fixed primitive Dth root of unity. Note that the character x is 
even, so a and —a occur simultaneously in each product. Therefore the norm 
from K* to F of any real cyclotomic unit 


1—¢ 
ee er 


is a unit in F, and the group generated by these norms (mod + 1) is infinite 
cyclic, generated by a unit 7 > O such that 


tn? = a'/a. 
From Theorem 5.2 we conclude that 
hy log ¢ = log y. 


Thus y = e”, and since 7 (mod +1) generates the norms of cyclotomic units 
in K*, this proves the index relation of Theorem 5.3. 


This index relation is analogous to that of Theorem 5.1 for the full cyclo- 
tomic field. Since K* is totally ramified over F (at the prime p) it follows from 
class field theory that 


hy divides hz. 
(Proof: Let Hy be the Hilbert class field of F. Then Hy N K*+ = F, so 
[H,-Kt : Kt) ae [Hy : F] = hy. 


Since H,;K*/K* is unramified, it follows by class field theory that h, divides 
hg.) 


For tables of some h,, see Borevich—Shafarevich, Number Theory, Aca- 
demic Press, p. 424. It has been observed for a long time that h, has very small 
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values, and grows very slowly. It is unknown if there are infinitely many real 
quadratic fields of class number 1. 


§6. The Dedekind Determinant 


Let G be a finite abelian group and G = {y} its character group. We have the 
Dedekind determinant relation: 


Theorem 6.1. Let f be any (complex valued) function on G. Then 


[1 > x@fte-*) = det f(a-*8). 


xuEG aeG 


Proof. Let F be the space of functions on G. It is a finite dimensional vector 
space whose dimension is the order of G. It has two natural bases. First, the 
characters {y}, and second the functions {65,}, b € G, where 


6(x)=1 if x=b 
6(x=0 if x #5. 


For each ae G let T,f be the function such that T, f(x) = f(ax). Then 
(Tax)(b) = x(ab) = x(a)x(6), 
so that 
Tax = x(a)x- 


So x is an eigenvector of T,. Let 


T= > fey. 


aeG 


Then 7 is a linear map on F, and for each character x, we have 
Tx = [> x@fa-)|x. 
aeG 


Therefore y is an eigenvector of T, and consequently the determinant of T is 
equal to the product over all y occurring on the left-hand side of the equality 
in Theorem 6.1. 

On the other hand, we look at the effect of T on the other basis. We have 


T,6(x) = 6,(ax), 
so that 7,6, is the characteristic function of a~1b, and 
T. 2b = 6a-- 
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Consequently 


Td, = > fla-)a-% 


aeG 


= > fla-b)oq. 


aéeG 


From this we find an expression for the determinant of T which is precisely 
the right-hand side in Theorem 4.1. This proves our theorem. 


Theorem 6.2. The determinant in Theorem 4.1 splits into 
det f(ab-1) = [> fa) det [f(ab-+) — f(a)]. 
a,b aeG a,d#1 

Therefore 


[1 > x@f@e-) = det (ab) — fla)}. 


2#1aEG 


Proof. Let a, = 1,..., a, be the elements of G. In the determinant 


f(aiazr*) flaaz*)- + -f(aaz*) 
det f(a,ajz4) = : : 
S(anazr*) f(anaz*)-- fanaa *) 


add the last n — 1 rows to the first. Then all elements of the new first row are 
equal to > f(a~*) = > f(a). Factoring this out yields 


1 Fo ox. 


[> fa| Lae aes : Seas) | 


aeG 


f(Qnar*)  f(Qnaz*)- + -f(@nan*) 


Recall that a, is chosen to be 1. Subtract the first column from each one of the 
other columns. You get the first statement. 

On the other hand, the function f can be selected so that the elements 
{f(@}, a € G, are algebraically independent over Q, and therefore the factori- 
zation given in this first statement for the determinant is applicable in the 
polynomial ring generated over Z by the variables f(a). Combining the first 
statement with Theorem 6.1 yields the second relation where the product is 
taken only over y # 1. 


Serre has pointed out to me that the determinant relation is due to Dede- 
kind, February 1896, who communicated it to Frobenius in March. Cf. 
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T. Hawkins, ‘‘New light on Frobenius...”, Archive for History of Exact 
Sciences 12 (1974), p. 223. 


§7. Bounds for Class Numbers 


In this book we have not emphasized questions having to do with the size 
of the class number. We shall here make some brief remarks concerning 
various possibilities to obtain bounds. We let A,, = class number of Q(u,,), 
and p is prime > 3. 

To begin we derive the expression of the class number A; as a determinant 
following Carlitz-Olson [Ca—O]. We start with the expression 


hy = 2p ial —4B,,, 


zyodad 


=I] -+_5 x05», 


because the characters are non-trivial, and the term with 4 drops out. We try 
to rewrite this as a Dedekind determinant over the group 


G = Z(p)*/+1. 


>t 

14 P 

Let w be the Teichmuller character such that w(a) = a mod p. We write odd 
characters as products 


We have 


xX = op 


where w is even. Then we find 


1 a —a 
p = 2p id 5 2, Vaa(a)( <5 ; 
and this makes sense because the function 


fla) = o(a(¢ 2 _ (2%) 


on Z(p)* is actually well defined mod + 1, so is defined on G. This expression 
is now in the form where we can apply the Dedekind determinant, thus getting 


hs = aon det| tad (2 - C=2)], 
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The size of the determinant is 


p-1 
2 


Let ¢ be a primitive (p — 1)-root of unity. Representatives for elements of G 
are given by the powers ¢‘ with 1 < i < (p — 1)/2. The determinant may then 
be taken over indices 
hj=1,...,254 witha=t,5=0, 

and w(ab) = C'*/. In the expansion of the determinant, every term contains a 
factor arising from these ¢‘*’, whose product is obviously 1. Consequently 
the determinant is the same as the determinant obtained by omitting these 
¢'*7 from each term. 

Let R(a) be the smallest positive integer in the residue class of a mod p. 
Then R(a) is an integer < p — 1, and 


2-80 


We use the notation R(¢'*+*) and R(—{¢‘*’) to denote similarly the smallest 
positive integers in the residue class of ¢'+7 and —¢'*/ respectively. Then we 
have proved the following theorem. 


Theorem 7.1. + Dy = (2p)?-?hS 
where 
p = det[R(C**”) — R(—C*4)]. 
Observe that each entry in the determinant D, is an integer of absolute 
value < p — 1. 
The absolute value of the determinant is the volume of the fundamental 
domain of its row vectors, say. This volume is bounded by the product of the 


Euclidean lengths of these vectors (Hadamard inequality). Carlitz [Ca] 
observed that this gives the bound 


hs < 27 GP- D4 = 1)?+9/4, 


As Carlitz—Olson relate it, the history of the determinant in Theorem 7.1 
is amusing. The determinant 


det R(ab-1), a,b = 1,..., pet 


was known classically as the Maillet determinant, conjectured to be 40 by 
Maillet. Malo computed it for p < 13, and found it equal to the appropriate 
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power of p. He conjectured that it was always so equal, but Carlitz-Olson 
computed a bit further, and found extra factors. They derived that the 
Maillet determinant is equal to the determinant of Theorem 7.1 (up to the 
obvious power of 2), and then also to the class number times that power of 
p by using the expression of the class number as a product of generalized 
Bernoulli numbers (not called that at the time). Thus Malo had missed out 
the class number factor. 

Metsankyla [Me] gives growth estimates for h”. Masley-Montgomery 
[M—M] also prove the inequalities 


(22) = P12 py(P — 25)/4 < hs < (22) = P12 p(P + 31/4. 


for primes p > 200. Thus the Carlitz bound is reasonably sharp. For applica- 
tions of this see Ribet [Ri]. 

For primes p, it has been proved by Uchida [Uch] that h, = 1 if and only 
if p < 19. More generally, Masley and Montgomery [M—M] subsequently 
proved that h,, = 1 for precisely 29 distinct values of m (always assumed 
#2 mod 4), the largest of which ism = 84. Masley [Mas 2] shows that h,, = 2 
if and only if m = 39, 56. For Euclidean cyclotomic fields, see also [Mas 1]. 
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In this chapter we return to p-adic integration theory, and give Mazur’s 
formulation of the p-adic L-function as Mellin transform. It turns out to be 
more convenient as a basic definition, than Iwasawa’s previous formulation 
in terms of power series. The connection is made via Example 2 of §1. We 
derive further analytic properties, which allow us to make explicit its value 
at s = 1, thereby obtaining Leopoldt’s formula in the p-adic case, analogous 
to that of the complex case. We also give Leopoldt’s version of the p-adic 
class number formula and regulator. 

The basic arguments are due to Leopoldt [Le 11]. However, we shall 
follow in §1 and §2 a course of Katz, which developed systematically opera- 
tions on measures and their corresponding formulation on power series in the 
Iwasawa algebra. In this manner, constructions which appear slightly tricky 
in Leopoldt’s paper here become completely natural, and even forced from 
these measure theoretic operations. 

The Leopoldt transform then appears as an extension of an integral 
transform to a somewhat wider class of power series than those with p-adic 
integral coefficients. No use will be made of this, since only integral valued 
measures occur in the analysis of the p-adic L-function, but we include 
Leopoldt’s results for completeness, for convenience of reference if the need 
ever arises for them. 

The p-adic L-function in the case of elliptic curves is discussed in Robert 
[Ro], and especially Coates—Wiles [C-W 2], [C-W 3]. See also Lichtenbaum 
[Li 3], and Katz [Ka] for general comments concerning its connection with 
formal groups. For the case of totally real fields, Shintani’s evaluation of the 
zeta function [Sh] presumably allows a development of the L-function similar 
to that of the cyclotomic case. 

This chapter is used only in Chapter 7, and it can therefore be omitted 
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without loss of the logical connections. On the other hand, if one leaves out 
the section on the p-adic regulator, then the chapter appears as a natural 
continuation of Chapter 2, and is essentially measure theoretic, independent 
of Chapter 3. 

Throughout, we need the fact that if o is the ring of integers in a p-adic 
field, then there is a natural isomorphism 


lim o[XV(1 + X)" — 1) x o[ X]]. 
<— 


The limit is the projective limit, and is called the Iwasawa algebra. This is a 
basic fact of algebra. In the next chapter, we need further facts about this 
algebra and modules over it. For the convenience of the reader, all these 
facts and their proofs will be placed in the next chapter. 


§1. Measures and Power Series 


Let C, be the completion of the algebraic closure of Q,, and let 0 = 0¢, 
be the ring of p-integers in C,. By a measure y we shall mean an 0-valued 
distribution on Z,. This means that for each integer n > O we have a function 


Hy: Z(p") > 9 


such that the family {u,} is a distribution on the projective system Z(p"). 

Let Cont(Z,, 0) or C(Z,, 9) be the space of continuous functions on Z, 
into 0, with sup norm. As usual, there is a bijection between measures and 
bounded functionals 


A: Cont(Z,, 0) > 0. 
[A Z,-linear map 4 is called bounded if there exists C > 0 such that 
|A(p)| < Clle|| for all  e Cont(Z,, 0). 


The inf of such C is called the norm of A, and denoted by ||A||. The bounded 
functionals form a p-adic space.] Indeed, it is clear that any measure p gives 
rise to a functional 


du: @ >| du. 


On the other hand, suppose 4 is a bounded functional. If x € Z(p”), let 9, 
be the characteristic function of the set of elements y € Z, such that 


y = xmod p". 
Define 
a(x) = A(oz)- 
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It is then clear that {u,} defines a measure. Since any continuous function 
on Z, can be uniformly approximated by step functions, it follows easily 


that the correspondence 
Hr> du 


is a bijection from o-valued measures on Z, to bounded functionals. 
Furthermore, define the norm 
ea = sup [Ha(x)|, 
Ns® 


taken for x € Z(p”) and all n. Then the map pt dy is easily verified to be 


norm preserving. 
The Iwasawa algebra is obtained as the projective limit 


Ap = lim o[X]/(1 + X)” — 1) & of[[X]], 
and 
of XY + xX)" — 1) = ofTY(T™ — 1) 
where T = 1 + X. Let y, = T mod (T”” — 1), so y2” = 1. Let as usual 


(;) Pes ae oo ae k +1) 


The function yu, on Z(p") can be viewed as an element of the group algebra 


o[y,], namely 


pha p-1 pea 
>, wale => sal) > (7) 4 
> (3 mo(,))* 


where the right-hand side is read mod (1 + X)°" — 1. Thus 


p™-1 p™-1 
D> moi = > coaX* = P(X), 
r=0 k=0 
where the coefficients c, , are given by 
pr=1 r 
Cae > wat; 
r=0 


The canonical homomorphism Z(p"**) + Z(p") maps 


Pn+i(X)+> P,(X), 
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and we let 


P(X) = lim P,(X) 


be the projective limit of these elements in the Iwasawa algebra. We call 
P(X) the power series associated to y, and also denote it by (Pu)(X) or 
P(X). Thus 


P: o-valued measures on Z, — o[[X]] 


is an o-linear map. Conversely, any power series fe o[[X]] defines a com- 
patible system of elements in the group algebras o[y,], so the map P is 
bijective. We write 


f=Pu or p=py 


to mean that f is the power series associated to as above. We call P the 
Iwasawa isomorphism. 
For any xe Z, let 


Gig x(x — 1). HG kl) _ (;): 


Since C,(r) is an integer for any positive integer r, and since Z* is dense in 
Z,, it follows that 


rane Amen oy 


is a polynomial map of Z, into itself, and in particular is continuous. 
For fixed n, define 


rr—1)---r-k+1 
C(x) = a a 
where 0 <r < p"— 1, and r= xmodp*. Then Cf is a step function, 
defined at level n, and 


lim C@® = C, uniformly. 


n7o 


Since the coefficients c,,, in the polynomial P,(X) are given by the sum of 
products of y,, and the binomial coefficient, we obtain: 


Theorem 1.1. Let f(X) = > ¢,X* € o[[X]]. Then 


Cy = [. () du,(x). 
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Theorem 1.2. The power series Pu is the unique power series f such that for 
z in the maximal ideal of 0, we have 


(1 + z)* du(x) = f(z). 


Zp 


Proof. We have 


I. (1 + 2)* du(x) = I. > (Z)= i. 


We can interchange the sum and integral, apply Theorem 1.1, and we see 
that Pu has the desired property. Uniqueness is obvious since any power series 
is determined by its values. 


Example 1. Let y be the Dirac measure at a point se Z,, that is 
) 9 du = 9s). 
Zp 
Then the associated power series f is 
= [s 
f= > ( )x = (1 + X). 
k= \k 


Example 2. Let v be a measure on Z, whose support lies in the open 
closed subset 1 + pZ,. Let y be a topological generator of 1 + pZ,, for 
instance y = 1 + p. There is an isomorphism 

Z,—>1 + pZ, 
such that 
Xr> y*, 


By pull back, there exists a unique measure p = yp, on Z, such that 
| u dy(u) = i y>* du;(x). 
14+PZp Zp 
By Theorem 1.2, writing ys = 1 + z, we get 


I ‘ uw dv(u) = f(ys — 1). 
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The power series f is not easily determined in terms of v. Iwasawa expressed 
his results on p-adic L-functions in terms of the power series f Mazur gave 
the formulation in terms of the integral, see §3 below. 


Theorem 1.3. (Mahler) A function @ from Z, into 0 is continuous if and only 
if there exist elements a, €0 such that |a,| > 0 and 


wor Salt 


The sequence {a,} is uniquely determined by 9. 


Proof. Given a sequence {a,} as above, it is clear that the function 


os) = > a,(") 


is continuous. For uniqueness, let 


Ag(x) = op + 1) — o(). 


Then g(0) = ao, and furthermore 


a(t) =(,2 i) M90) Zen) 


and 
A*p(0) = a. 


This proves uniqueness. 

We now prove existence. In the applications, the measures will take values 
in the ring of p-adic integers in a finite extension of Q,. An argument using 
tensor products reduces the general case to this case, and we omit it since we 
have no use for it. The case of a finite extension is then reduced to the case 
when the measure is Z,-valued by taking a basis for the ring of values over 
Z, and projecting on the coordinates. We now handle this case. 


Let B be the Banach space of sequences (a,) with a, € Z,, and |a,| — 0, 
under the sup norm. We have a Z,-linear map 


B--CZy,Z,) by (a)+> > an(*)- 


We have to show it is surjective. By completeness of C(Z,, Z,) it suffices to 
prove that a given fe C(Z,, Z,) is congruent to the image of an element 
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of B mod p” for each x, and by a simple recursion, it suffices to do this mod p. 
In other words, it suffices to prove that the map 


{(Gn), An € Fp, almost all a, = 0} > C(Zp, F,) 


given by the same formula as above, is surjective. But 
C(Z», Fy) = (_) Maps(Z(p"), Fy) 
N 


because F, is discrete and finite. 


Lemma. Let 0 < k < p%. Then the function 
x 
> d 
x ( ) mod p 


of Z, into F, is periodic of period p™. 
Proof. We have to show 


( ) 
k 


(1+ 7)” = (1 + TY + TP” = (bh + TH + T?”) mod p, 


(7) mod p ifk <p". 


we prove the lemma by comparing the coefficients of T*. 
Now we are reduced to showing that 
{(Qn), In € Fp, Qn = O ifn > p%} > Maps(Z(p"), F,) 
is bijective. Since both spaces have F,-dimension p%, the surjectivity follows 
from injectivity, which is proved the same way we proved that the function 


(x) has uniquely determined coefficients a,. This proves Mahler’s theorem. 


Corollary. If f(X) = > c,X" and 


(x) = Da,(*); 


then 


| ? dp; = > AnCn, 


I/ @ dp; 


sO 


< (supla, |) fl < IF. 


§2. Operations on Measures and Power Series 


We define the norms: 
IF = sup lea 


|| = sup |u,(x)| as before. 


Theorem 1.4. We have |f|| = ||;||. 


Proof. Since 
x 
Ch =| ( ) du,(x), 
n 


we get trivially ||f|| < ||u,||. Conversely, given a level p”, let x9 € Z(p") and 
let ~ be the locally constant function such that 


Q(X) = 1, and g(x) =0 if x 4 xo, xe Z(p"). 
Then 
| ea, = Un(Xo), 


and on the other hand, from the corollary of Theorem 1.3, we get 


| fod, 


< If, 


so ||u,|| < ||/|| as desired. 


§2. Operations on Measures and Power Series 


We shall give a list of integration formulas, or better, a list of operations on 
measures and their corresponding operations on power series. 


Meas 0. i Ai HO): 
Zp 
Proof. Special case of Theorem 1.2 with z = 0. 
For the next property, we let 
wAx) = (1 + z)* 
if ze m = maximal ideal of 0. Also (with formal groups in mind) we write 
X[4)z=X4+24+2X¥ =(14+2)0,4+ X)- 1. 
Meas 1. Wolly = Hy, where g(X) = f(X[+]zZ). 
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Proof. For w em we have 


[Ye and = | daha a 


=] (1+ wl + 2)* du,(x) 


Zp 


=| d+w+z+ wz) duj(x). 


Zp 
The property is then clear from the definitions. 


In particular, let € be a p"th root of unity, and let z = € — 1. Then 


Wx) = o* 


and we find: 


Meas 2. We-aby = Hg, 
where g(X) = f(CU + X) — 1) = f(X[+]€ — 1). 


As before, putting T = 1 + X, and f(X) = f¢,(T) if fis a rational func- 
tion, we can write the power series g(X) in Meas 2 in the form 


&(X) = fen(ST). 


Moreover, let @ be a step function, constant on cosets mod p". Write the 
Fourier expansion 


ax= > oO 


cna 
0 = > ven. 
Pp xEZ(pr) 
We find: 

Meas 3. Puy = Ug 
where g(X)= > @OfEU + X)—- LD. 

cena 
If f(X) = fe,(T) is a rational function, then 

BolT) = >, OOfeq(6T). 
gma 

Let U, = U be the operator 

UH) = FO ~ 5D ACU + XD D. 
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We call U the unitization operator because of the next property. 
Meas 4. If @ = characteristic function of Z*, then 


PL; = Mus: 


Proof. We compute trivially the Fourier expansion of 9: 
1 p-1 =“ I/p if ¢ # 1 
OO = 5 DOT eH! tea, 
P 
Then Meas 3 gives 
g(X) = > oOfEU + X)- 1) = U(X), 


(P=1 


as was to be shown. 


Remark. Let A be the formal multiplicative group (cf. Chapter 8). In the 
notation of such groups, we can write the unitization operator in the form 


UM(X) = f(X) - ; > AKU, 


Meas 5. Let x be a character on Zx*, of finite order with conductor N = 
power of p. Let € be a primitive Nth root of unity, and let 


S(O = pa x(aee. 
aeZ(N)* 
Then 
XU = Uy 
where 
(1) = 69D Hale“ +) - D. 


If f is a rational function, then 


go,(T) = “2D ¥ yalfeg(C-*T). 


Proof. It suffices to apply Meas 3 and to compute the Fourier transform 
of x. This is trivial, and we have 


> yone?” -{ if x = 0 mod p 
veZiny ¥@QSY,0) ifx=ax#Omodp. 
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Meas 6. XU (X) = Mp,(x) 


where D = (1 + X)Dx. In particular 


i 


x* d(x) = i) dhips, = D*f(0). 


‘Pp 


Proof. Note that 


x = fim LAF ATL jig VOT 
270 Zz 270 Zz 


Hence for any step function g we get 
; Ax) — 1 
J 009 cata) = tim | P22 =F (xy a) 


= fim | (%) dtg,(2) (by Meas 1) 


where 


J(X + 24+ 2X) — f(X) 


= = (1 + X)f'(X) mod z 


g(X) eae 


by Taylor’s formula. The desired result follows by taking the limit as z > 0 
and using the non-trivial part of Theorem 1.4, that is: 


Hoe — Hosll < lige — Df|| < |zI. 


Remark. We shall deal throughout with three variables. Let T be the 
variable on the “multiplicative group.” We put 


T = é’, X=T-1, T=1+¥X. 


Then Z is the corresponding variable on the additive group. For any power 
series f(X) (with coefficients in a field of characteristic 0) there is a corre- 
sponding power series denoted by f*(Z) or f,(Z) such that 


S(X) = fle? — 1) = fa Z) = feg(T)- 


This last equality makes sense only when f is a rational function. 
The differential operator D then can be expressed in terms of the three 
variables, 


ad + X)Dx = D; = TD,. 


The expression in terms of T applies only to rational functions of T (rational 
functions of X). The first two expressions in terms of X and Z apply to 
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arbitrary power series, and for any positive integer k, the expression D*f 
makes sense whether we view fas power series in X or Z. Furthermore, 


D'f(0) = Dzfe,(0). 
If fis a rational function, this is also equal to (TD,)*f,,,(1). 
Meas 7. Let g = Ug so yu, is a measure on Z¥. Then 
a~*y,(4) = Hun(@) 


where h is any power series such that Dh = g. 


Proof. Since a~11,(a) is a measure on Z*, there exists a power series f 
such that fe o[[X]], 


a~*yfa) = ua) and Uf=f. 
Then 
U(x) = xp,(x) 
whence by Meas 6, 
g = Df = DU. 


We let h = Uf to conclude the proof. 


§3. The Mellin Transform and p-adic L-function 
Let w be the Teichmuller character. If p is odd, then 


- 7 
@: Zy > Mp-1 


is the character such that w(a) = a mod p. If p = 2, then we define w(a) = +1 
such that 


o(a) = amod 4. 
Then we can write uniquely an element ae Z* as 
a = w(a)<a>, 
where <a> = 1 mod p if p is odd, and <a> = 1 mod 4 if p = 2. 


Let u be a measure. We define its Gamma transform as a function on Z, 
by the integral 


Fouts) = | <a>* duc), 


and we define its Mellin transform, also as function on Z,, by 


M,y(s) = if aya dua). 
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4. The p-adic L-function 


It is clear that ’,u and M,w are continuous in s. (For analyticity, see below.) 
Since the integral is taken on Z*, M,u depends only on the restriction of 
pe to Z3, so if w = yy, then 


Mp, = Mpbuy- 


If » = py, we write sometimes M,f instead of M,y,;, and similarly for the 
Gamma transform. 

Note that a7! du(a) for ae Z* is also the functional associated with a 
measure, so that the Mellin transform is actually a special case of the Gamma 
transform (of another measure). 


Theorem 3.1. Let g ¢o[[X]] be such that Ug = g, and let h be a power 
series such that Dh = g. Then Uhe o[[X]] and 


T,Uh = Mypy. 


Proof. This is an immediate application of Meas 7, after integrating the 
function <a>‘. 


We now consider the analyticity properties. 


Lemma. Let 1 be a measure on ZX. Then there exists a power scries 
he Z,[[s]], 


h(s) = >. bs” 
n=0 
such that b, > 0 as n—> ©, with the property that for all s< Z,, 
A(s) = i <a>’ du. 
Zp 
Proof. The integral can be written as a sum of integrals over cosets of 
1 + pZ, (or 1 + 4Z, if p = 2). Changing the measure appropriately with 


respect to each coset, we are reduced to proving (say for odd p) that for any 
measure yp, the integral 


| <a>? dy 
1+ PZp 


has the desired analyticity property. We note that 


Joan, Or | S (j)@- pr aca 


-| ; >, 36 - DAGeas ye s Hie: 
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But a — 1 = 0 mod pz, and so (a — 1)"/n! is p-integral for all n. Furthermore, 
(a — 1)"/n! tends to 0 p-adically as n> oo. Hence we can interchange the 
sum and integral to yield 


[<a du = > Palsre, 
1+PpZp n=0 
where P, is a polynomial of degree n with integral coefficients, and 
a-— 1) 
Ch = | ey du(a) 
1+PZp : 
is p-integral, and c, — 0. It is then clear that > P,(s)c, can be written as a 
power series h(s) whose coefficients b, tend to 0 as desired. 
We had the measure F,,, in Chapter 2, with ce Z%. Let s be a p-adic 


variable in Z,. For any c such that x(c)<c)* is not identically 1 we define the 
p-adic L-function L, by 


r,(1 —SO)= M,(xFi,S) 


a 
1 — x(c)<c>* 


= OS ie <ay*y(a)a~* dE, (a). 


By the lemma, the integral is analytic as a function of s. The factor in front 
is analytic except when 


x(e)<e)® = 


If y is non-trivial, we can select c such that y(c) € 1, and then the factor in 
front is also analytic at s = 0. 


Theorem 3.2. The value of L,(1 — s, y) is independent of the choice of c, 
and for any positive integer k, 


1 
L,(1 om k, 4) = ~k Br, yo-*: 
In particular, if k = 0 mod p — 1, and p is odd, then 


1 
L,(1 — kX) = 7 Bes 


Proof. Since the set of sufficiently large integers k = 0 mod p — 1 is dense 
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4. The p-adic L-function 


in Z,, we see that the first assertion follows from the explicit values given at 
integers of the form 1 — k as described. For these, we have: 


MLE: XK) = | Sart ty@ola)"? dE 0 
=[ a to(@)*4(@) aE) 
= (I~ xo""(cc¥) } Byyo-* by Theorem 2.4 of Chapter 2 
= (1 = (0K) f Berzo™ 


This proves the theorem. 
Theorem 3.3. Let g = Ug and let h be the power series such that 


Dh=g and h(0)=0. 
Then 


M,14,(0) = — ; DMG -D. 


Proof. By Meas 7 we have 


M140) = { a-* dyg(a) = [| duan(a) = UAC. 


The formula is then clear from the definition of U. 


To compute L,(1, y) we have to work out the power series associated to 
£,,, and then apply the formalism of the preceding section systematically 
to get the answer, with s = 0 in Z,(1 — s, x), using Theorem 3.3. 


Proposition 3.4. Let c ¢ Z*. The power series associated with the measure 
Ey, ¢ is 


1 c ; 
fie = PIT FELT? withT=14 X. 


Proof. It is immediate to verify that as power series in X the expression 
on the right-hand side is holomorphic at X = 0, and that its coefficients are 
p-integral because c is a p-unit. Let 


_ logT  clogT 
ID) = Fa FT 
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§3. The Mellin Transform and p-adic L-function 


Putting T = e7 we find 


f(1) = f"(Z) = za - we 


Zt 
=a - BF: 
On the other hand, let f,,. be the power series associated with E,,,, and write 


ad Zk-1 
fil) = ftdZ) = > tea Epi 


Since 


| xkn-1 dE, = lea ce c*), 
z k 


‘Pp 


it follows from Meas 6 that 
ex-1 = D¥-Yf8.(0) = 7 Bl - ©, 


so 


2frAZ) = D1 — cB, 2 = f*(Z) = 7 i fan 1 


It follows that 


as desired. 


Proposition 3.5. Let y be a non-trivial character on Z* with conductor N. 
The power series associated with yE,,, is 


Sx,c = GAT) _ cx(c)G (T°) 


where 


ofr) = GPS io map 


Proof. Immediate from Meas 5. 


We shall now assume that c is an integer >1 prime to p. 
Written in full, the power series for g,,, is 


AY ; 2 a ‘a 
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If we let 


where: 


A ranges over cth roots of unity # 1, 


a ranges over Z(N)*, 
then it is easy to see (and we carry out the computation below) that 
Dhy,c = &y,c and hy,(0) = 0. 
Furthermore, 
—Myg,,c(5) = (1 — x(e<e>)L,(1 — 5, x). 


The situation is then set up to apply Theorem 3.3. 
We now prove that Dh,,. = 8y,:. Observe that since > x(a) = 0, we have 


a(t) = HOS 4a = DS to te 


CAS) 
Lemma. Ne 


> Ko pap = OG LT) - GAT). 


A#l a 


Proof. Taking the logarithmic derivative of 


To — (% = [[qw- AC?) 


a 


we obtain 


: T  _ y X@cT° x(@)T 
DIO) F—7R = c Te — Cae ~ a a in 


A#l 
Multiplying by S(y, 4)/N proves the lemma. 
The assertion 
Dhy,c = 8, 
follows by using 


xX _T-1e 
a Os Sen BRET C 


and differentiating naively using D = TDr. 
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We shall recall below how it is possible to extend the definition of the 
p-adic logarithm uniquely to a continuous function on all of C} such that 
log p = 0. This is the log with which we deal in the next theorem, giving us 
Leopoldt’s value of the Z-function L,(s, x) at s = 1. 


Theorem 3.6. Let x be a primitive Dirichlet character with conductor N equal 
to a power of p. Then 


g(t, 1) = -SG59 Ha) log — £). 


Proof. By Theorem 3.3, Proposition 3.5, and the definition of L,(1, x), we 
find: 


(1 ~ 0)LaCt, 0 = 5 HOSS ¥ wo) toa(1 + Pr) 


a A#l1 


145 > x(a) log] | Ti == <F 


ee 


But 


g— x ee Sa 1 ager 
[i - laa — Tey tA ae 


Using the fact that N is the conductor of y and that the sum of a non-trivial 
character over a group is 0, we leave to the reader the verification that 


> X@ log(l — Ager) = 0. 
It follows that 


(1 = ODL s(t, ) = “B'S g@) log TT (1 - 28°) 
= 5G!) ¢) 2 x(a) ions = i 


= Sus $) (x(c) — 1) > x@ log — ©), 
as was to be shown. 


Appendix. The p-adic Logarithm 


We recall briefly how to extend the p-adic log to the multiplicative group 
C%. The p-adic log is defined first by the usual series 


2 
log,(I + x)= x — 4+ 
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4. The p-adic L-function 


We shall omit the p as subscript. The series converges for |x| < 1 (the 
absolute value is that on C,, the completion of the algebraic closure of Q,). 
We extend the log to all units of C* as follows. The units have a product 
decomposition 


U = ppp, x U; 


where pp) is the group of roots of unity in F of order prime to p, and U, is 
the group of units = 1 (mod p), and |p in C%. For each unit u we let <u> 
be its projection on U,, and we define 


log u = log <u. 


Thus the log has been extended to all units, and it is clear that this extension 
is continuous, and is a homomorphism. 

It is even possible to extend the log to the whole multiplicative group 
C* (following Iwasawa). We let P be a subgroup of C} containing the 
powers of p, and one ¢th root of p for each positive rational number ¢. Then 
the multiplicative group of C* has the product decomposition 


Px Up) x U;. 


Again we define the log of an element « € C¥ to be the log of its projection 
on Uj. 

We leave it as an exercise to the reader to verify that this extension is 
continuous. It is obviously a homomorphism. In particular, log p = 0. 

As for uniqueness, suppose the log has been extended to a continuous 
function on C*, which is a homomorphism into the additive group. Then the 
log has to vanish on all roots of unity. If log p = 0 then log p’ = 0 for all 
rational numbers r. Given a € C* there exists r such that ap’ is a unit. Hence 
the extension is determined by its values on units. Furthermore there exists 
a root of unity ¢ such that ap’t is =1 mod m, where m is the maximal ideal 
of the integers of C,. Hence the log is determined by its values on elements 
=I mod m, where it is defined by the usual power series. This proves 
uniqueness. 


§4. The p-adic Regulator 


Let K be a totally real number field, and let E = Ex be the group of units 
of K. Let p be a prime number. Let 14,..., u, be a family of independent 
units in K, and let 


o:K—>C,, i=1,...,r4+1 
be the embeddings of K in the p-adic complex numbers (completion of the 
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algebraic closure of Q,). We suppose o,,, = id. We define the p-adic regulator 
up to sign, 


R,(uy, ..-, Up) = + det log o,u,. 


If u,,..., u, are a basis for the units (mod roots of unity), we simply call it 
the p-adic regulator, and write 


R, os Rx,p 7 R,(Ex) am R,(E). 


If K is the real subfield of Q(u,,), and & is the group generated by the real 
cyclotomic units, then we let 


R,(é) a Riu, cat u,) 


where u,,..., u, generate these cyclotomic units, mod +1. 
We leave it to the reader to verify: 


Theorem 4.1. Let K be the real subfield of Q(um). Then 
R,@) = (E: €)R,(E) = (E: €)R,. 
We know from Theorem 5.1 of the preceding chapter that 
ht =(E:@). 
Let g, (a prime to m) be the cyclotomic units, and gi the corresponding 


real cyclotomic units. From our definition of the p-adic log, we know that 
for any embedding o: Q(un,) > C,, 


log og, = log og?. 


Thus in writing down the regulator, we can use the usual form for the 
cyclotomic units, without bothering to write down the extra root of unity. 

We may write the p-adic cyclotomic regulator by the Frobenius determinant 
formula, 


R,(@) = det, log cag = | | > x(@) logg., 


2X#1 aeG 


where G = Z(m)*/+1. Since 
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4. The p-adic L-function 


and since Daeg X(a) = 0, we may also write this formula in the form 


RAE) = [ | > x@ loge — 1). 


x#1aEeG 


The product is taken over all non-trivial characters of Z(m)*/ +1. 


Theorem 4.2 (Brumer). We have R, #0 for the real cyclotomic field 

QlUn)*. 

Proof. The cyclotomic units are algebraic, and it is a known theorem from 
the theory of transcendental numbers that the logs (p-adic or otherwise) of 
multiplicatively independent algebraic numbers are linearly independent over 
the algebraic numbers. The proof is the p-adic analogue of Baker’s proof for 
the corresponding result over the complex numbers, see Brumer [Br], or 
[L 4], before Chapter VIII, Introduction to the Baker method. The proof given 
there applies p-adically. The factorization of the regulator into a product of 
linear forms in logarithms then shows that the regulator is not 0. 


Theorem 4.3 (Leopoldt p-adic Class Number-regulator Formula). Let 
m = p” be a prime power, and K* = Q(um)*. Then 


ial Sra Y= eRe 
x#1 2 Vdx+ 


x even 


Proof. From Theorem 4.1 and the complexly derived index 
h* =(E:@) 


of Theorem 5.1 in the preceding chapter, we find: 


TI [1 z@ loge — 1) 


+h*R,(E) = + R,(€) 


2Z#1 aeG 
xeven 
my) 1 
= [[-~—454.0, 
I] 5G, D3 Lo O 


yeven 


by Theorem 3.6, the 4 appearing because G = Z(m)*/ +1, 


1 
= Vdx+ | | — 5 L(1, x) 
A#1 
Aeven 


by using the complex Theorem 3.1 of Chapter 3. Selecting the sign of the 
regulator R, appropriately yields the desired formula. 
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Remark. The proof of the formula involves the complex case. Presumably 
there is a direct proof, which is valid for all totally real number fields K. 
Cf. the Appendix of Coates Durham lectures [Co 3], where such a proof is 
given for the characteristic polynomial of a certain Iwasawa module. The 
extent to which analogues of cyclotomic units will ultimately play a role in 
such proofs is not clear at present. 


§5. The Formal Leopoldt Transform 


Let K be a field of characteristic 0. Let T be the variable on the multiplicative 
group. We put: 


T = é?, X=T-1, T=X+1, 
Then Z is the corresponding variable on the additive group. Note that 
X" = (T — 1)" = (e? — 1)”. 
Changing variables gives rise to the notation 
S(X) = fe? — 1) = fe (Z) = fe,(T). 


This last equality makes sense only when / is a rational function. 
For any power series fe K[[X]] we define the Leopoldt transform If as a 
function on integers >0 by 


fol Z) = > 1H) E- 
As before we let 
Dx = d/dX, D, = d/dZ, Dy = dfdT. 
Then 
Dz = (1 + X)Dx = TDy, 
and for any integer k > 0, 
I'f(k) = Difelz-0 = (TDr)'feqlr=1- 


Define coefficients y,(k) by 


( - 1" = > OZ. 


K=1 
Then 
Ynlk) = Die? = 1)"|z=0 = (TD,z)(T — 1)"|ra1 
yk) = 0 ifk <n. 
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4. The p-adic L-function 


Lemma. (a) We have 
n 7 n . 
mn = > (1) (Ce 


(b) Each integer y,(k) is divisible by n!. 
Proof. As to the first assertion, it is immediate by induction that 
(TD,)'T? = i*T', 


Hence 
yal) = (TD )(T = 1)"|par = (TD2)* (-o(7)r 
i=0 qt 


has the expression as stated. On the other hand by induction it follows that 
given an integer n, for each integer & there exist integers a,,..., @,—, such 
that a, = Oif i > 0, and 


(TD7)(T — 1)" = a,(T — 1)" + na,_\(T — 1)"-2 +--- 
+ n(n —1)---a—k+ Da,_.(T — 1)"-*. 


Putting T = 1 yields the second assertion. 


In the light of the lemma, a power series f(X) has the Z-expression 


1H) = > > a, WOZ 


n=0k=0 


Consequently, 


=F Faye BOZ 
n=0k=0 n! k! 


These formulas can be summarized in the following theorem. We let C(Z, K) 
denote the space of functions from Z into K. 


Theorem 5.1. There exists a unique linear map 
I: K[X]— C(Z, K) 
Satisfying any one of the following equivalent conditions: 
T1. (1 + X)")\(k) = m* for all integers m > 0. 
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12. r(77) (&) = 2, 
r3. faZ) = > 1H Z. 


This map also satisfies: 
T4, I(ZDzf k) = kIf(k). 


Observe that the Leopoldt transform is defined by 13 for power series. 
The other two conditions T1 and T'2 do not make sense for power series. 
However, in the next section, we shall work over a p-adic field K where these 
other conditions do make sense for a suitably restricted set of power series, 
with certain convergence conditions. 


§6. The p-adic Leopoldt Transform 


For simplicity, we suppose that pis an odd prime number. Let K be finite over 
Q,. Let C, = completion of the algebraic closure of K. We denote the p-adic 
absolute value by | | = | |,, normalized so that 


|p| = 1/p. 


We define the Leopoldt space: 
L = YL, = space of power series 


x” 
KX) => a, ar an eK, 
such that 
lim |a,| = 0. 
We define the Leopoldt norm 
Ife = max ay. 
Then is a Banach space, and a Banach algebra because || fg || v < || / || gllgllz. 
Theorem 6.1. If fe £ then f converges on the disc of elements 
xeEC, and |x| < |plve-». 


For such x we have 


If@)| < fle. 
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Proof. Obvious, because 


x” 
|p|"@-» < |n!| and so a, 5 S |a,|. 


We let C(Z,, K) = Banach space of continuous functions on Z, with 
values in K, and the sup norm. 


If ae Z, and pla, we let <a> = 0. 
If ae ZF we write 
a = (<a> = a(aXa> 


where € € p,_, and <a> = 1 mod p. The Teichmuller character w by definition 
is such that 


a(a) = ¢. 
If se Z,, then 
<ay>s = lim <a>* 
k7s 
is defined in the usual way, where k ranges over positive integers approaching 
s p-adically. If a is not prime to p, then we let <a>* = 0 for all s. 


If y is a character on Z*, as usual we put y(m) = 0 if m is divisible by p, 
so 


x(mXmy* = 0 if plm. 


If y = w* where a@ is a residue class mod p — 1, and k = « mod p — 1, then 
for any positive integer i such that p{i, we have 


i® = w(i)<i>*. 


Theorem 6.2. Let « be a residue class mod p — 1. There exists a unique 
continuous linear map 


sa Lx and C(Z,, K) 
satisfying any one of the following three equivalent conditions: 
ra 1. PA + X)"\(s) = w%(mm)>*, 


for any integer m > 0. 


r.2 r(2)o = 2 > coro 
Te 3. f(s) = lim Ff(em) = lim > ag (=) (m) 
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where the limit is taken over positive integers m satisfying: 
(*) m-—> ©, m-> Ss p-adically, m=amodp — 1. 


This map also satisfies 


ITfll < If lle. 


and for s € Zp, 


Yr. 4. P'((ZDz)f\s) = s0,f(s). 


Proof. Any continuous linear map on the space of polynomials (with 
Leopoldt norm) extends uniquely by continuity to the Leopoldt Banach 
algebra. We shall prove that the linear map 


T,: K[X]— C(Z,, K) 
with values 


r(2)o) = 5 > cr'(Jowar 


is continuous, and has the other properties. Uniqueness is obvious. 
If f(X) = > a,(X"/n!) lies in Y, then |a,|—> 0 as n— oo. To prove that 
I, is continuous, it will therefore suffice to prove that the values 


re(2r)) 


are bounded. In fact, we shall see that ’,(X"/n!)(s) is p-integral. This will also 
prove that 


IT.fll < lf lle. 


Fix the integer n. Let m range over integers as in I, 3. Such integers are 
dense in Z,, so it suffices to prove that 


I, (=) (m) is p-integral for such m. 


If i = 0 mod p, then i"/n! is p-integral for large m. If i # 0 mod p, and 
i= aff), 
then for m close to s p-adically, 


i™ = w(i)<i>" = w*({)<ID* ~—s mod high power of p. 
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The Lemma (b) of §2 then concludes the proof that |,f|| < ||fl|g, and the 
arguments also show that I, 1 and I, 3 are satisfied. It is clear that I’, also 
satisfies [.. 4, thereby concluding the proof of the theorem. 


We call I’) the p-adic Leopoldt transform. 

The Leopoldt transform p-adically with characters w* was first used by 
Lichtenbaum [Li 3] to deal with elliptic curves. 

In Theorem 6.5 below we shall prove that I, is the p-adic Gamma 


transform already mentioned in §3 when applied to a power series with 
coefficients in 0. Hence we may then write 


Tof = pf. 


We recall the operator 
1 
YX) =) - 5 a FOX +N -D, 


where the sum is taken over all pth roots of unity ¢. Then for the special 
polynomial (1 + X)" we have 


Ud + X)) = (1+ x)" -5 pas +x) 


n(y 15 e), 
=a + x91 52°) 
and 
JP ifp|n 
26 ={6 if ptn. 
Hence 


ul +X) ‘ if pln 
~ lat xy" if ptr, 
and in particular, U is a projection operator, i.e., 


U? = U. 


The next lemma describes the continuity property of the operator U for 
the Leopoldt norm. 
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Lemma. |Uflle < fle. 
Proof. 


> AMEX+6-Y)= yoke t= 
#1 A 

343 (ene - ne 
BEA Ge mer 


The coefficient of X*/k! in the above sum is either 0 or 


4 ! kyk 
Da oma 3y ee 


nm CFL 


and |¢ — 1|"-* < |(n — k)!|, so the coefficient of a, is not a unit at p. But 
> © — yee 
CF 

is a rational integer, and is therefore =0 mod p. Hence 


1 1 os XK 
1s pex+t-=15($ ama) 
rate C—D=5 2 (2 anbnx) Fy 
where b,,, € Z and b,,;,, = 0 mod p. It is then immediate that 
Use < fle. 
as desired. 


The next theorems prove for the Leopoldt transform on the Leopoldt 
space results which have already been proved for measures. 


Theorem 6.3. Let m be an integer >0, and m = a mod p — 1. Then 
I,f(m) = TUf(m). 
Proof. The two maps 
fieT,f and fieruf 


of K[X]— C(Z,, K) are equal on the polynomials (1 + X)”. For a fixed m 
the maps 


fteTf(m) and fr>ruf(m) 
are continuous, so the theorem follows by continuity. 
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Theorem 6.4. For fe Ly we have 


Pof(0) = USO) = FO) — ; Ps SE — 1). 


Proof. The power series for Uf in terms of X or Z have the same constant 
term. Hence 


FUf(0) = Uf). 


Taking « = 0, the theorem is obvious from Theorem 6.3, and the fact that 
¢ — 1 lies in the domain of convergence of f by Theorem 6.1. 


The next theorem resulted from a conversation with Ribet. 


Theorem 6.5. For s € Z, and fé o[[X]] we have 


[<a> dua) = Posts. 


Proof. By continuity in s, it suffices to prove the theorem when s = k is 
an integer > 1 and k = 0 mod p — 1. Let 9 be the characteristic function of 
Z*, Then 


Zp 


[carr dun = | x00) dao 


ii x* diy ,(x) 


D‘Uf(0) 
= TUf(k) 
= Tof(k). 


This proves the theorem. 


We now see that the Leopoldt transform is an extension of the Gamma 
transform to the Leopoldt space. 
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Iwasawa Theory and Ideal Class Groups 


We shall now study Iwasawa’s theory concerning projective limits in Z,- 
extensions. 

The first three sections establish purely algebraic facts about projective 
limits, and finitely generated modules over the power series ring Z,[[X]] 
which appears as the limit of p-adic group rings of cyclic groups. The situation 
is quite similar to modules over principal rings when considering finitely 
generated modules over integrally closed Noetherian domains. Cf. Bourbaki, 
Commutative Algebra, Chapter VII, §4, where a general structure theorem is 
given. For 2-dimensional local rings, this was complemented by Serre [Se 1] 
who showed that reflexive modules in that case are free, thus getting a com- 
plete result for Z,[[X]]. Here we shall follow Paul Cohen’s proof analogous 
to finding elementary divisors by row and column operations. 

Weshall also follow Serre’s exposition [Se 1], giving the asymptotic estimate 
for the orders of the factor modules. This is applied afterwards to the orders 
of ideal class groups. Iwasawa’s original proofs were rather complicated, and 
his point of view was that of projective limits of finite abelian p-groups on 
which I ~ Z, operates continuously, and which are of topologically finite 
type for this action. See [Iw 1], [Iw 6]. Serre [Se 1] saw that there was an 
isomorphism of categories between these objects and Z,[[X]]-modules of 
finite type. He introduced this point of view which simplified the proofs and 
also proved successful in subsequent applications. 

The next three sections deal with arithmetic situations arising as special 
cases (but which historically motivated the general results). We consider 
several modules over the Iwasawa algebra. First, we deal with the projective 
limit of ideal class groups. Class field theory identifies this projective limit 
with a Galois group. The reader unacquainted with class field theory can 
simply take for granted the isomorphism, which is described as we need it. 
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We follow mostly Serre’s exposition [Se 1]. The results are valid for arbitrary 
Z,-extensions, not necessarily cyclotomic ones. 

The final two sections go further into certain Galois groups as modules 
over the Iwasawa algebra, and also describe all possible Z,-extensions of a 
given number field in class field theoretic terms. The Leopoldt conjecture 
would imply that there are precisely r. + 1 independent ones. This depends 
on the Z,-rank of the closure of the global units in the local units. See §5, 
Theorem 5.2. 


§1. The Iwasawa Algebra 


Let I be a topological group isomorphic to Z,. We write [ multiplicatively, 
and let y be a fixed generator, so that the isomorphism may be written 


xt> yp” for x € Zp. 


lr, =T/0" ~ Zp"). 


Then I, is cyclic of order p", generated by the image of y. Conversely, a 

compatible system {y,} of generators in a projective system {I°,} of cyclic 

groups of order p” would give rise to a generator y in their projective limit. 
We have a commutative diagram 


Z[Pn+i1] > Z[T\(T"** — 1) 


| 


Z,1F 2] as Z,[T]/(T" _ 1) 


where T is a variable. Let ¥ = T-— 1, T= X + 1. Then Z,[T] = Z,[X], 
and 


Z[TW(T™" — 1) % ZLXVCX + 1" — 1). 
Let 
h, = h,(X) = (1 + X)" — 1. 
Then 
ho SX hos, 


and all coefficients other than the leading coefficient are divisible by p. Such 
a polynomial is called distinguished. 
We wish to establish an isomorphism 


Z,[[X]] => lim Z,[P,] = lim Z,[X1/(n). 
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Let 
A = Z,[[X]]. 


We first note that if A is any distinguished polynomial, then 
ZyLX VA) & ALhA. 


This is immediate from the Euclidean algorithm (see Theorem 3.1), which 
shows that A/hA is free of rank deg h over Z,, and similarly Z,[X]/(#) is free 
of the same rank over Z,. Furthermore this same algorithm shows that the 
natural map 


ZL X (A) > AAA 


is surjective, so is an isomorphism. 
We thus obtain a natural map for each n, 


Z,[[X]] = Zn] = Zp[X Vn); 
whence a homomorphism 
e:A = Z,[[X]] — lim Z,[X]/(A,). 


Theorem 1.1. The homomorphism « is an isomorphism. 


Proof. A trivial induction shows that 
h, = (1 + X)P" — le(p, X)**# 


where (p, X) denotes the maximal ideal of Z,[X], generated by p and X. 
It follows that the intersection of the ideals h,A must be 0, whence the kernel 
of ¢ is 0. Since ¢ is clearly surjective, this proves the theorem. 


Note that the isomorphism ¢ depends on the original choice of generator y. 
The projective limit 


lim Z,{09] 
< 


is called the Iwasawa algebra. Given a choice of generator y, it is identified 
with Z,[[X]] by Theorem 1.1, and then we also call Z,[[X]] the Iwasawa 
algebra. 

We now consider modules over the Iwasawa algebra. 

For each n let V, be a module over Z,[I’,], and suppose we have homo- 
morphisms 


Vasa = V;,, 
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compatible with the action of the group rings Z,[I°,,,] and Z,[I”,] respec- 
tively. We may form the projective limit 


V = lim V,, 


which is then a A-module. 


Examples. In §4 of this chapter, V, = C, is the p-primary part of the ideal 
class group, and so the projective limit C is a module over A = Z,[[X]]. In 
Chapter 7, we shall consider projective systems of local units as modules over 
the Iwasawa algebra. 


If each V, is a finite abelian group, or is compact, then the projective limit 
V is compact, and Z,[[X]] operates continuously on V, which is then what we 
call a topological module over Z,[[X]]. (Here and in the sequel, compact 
means compact Hausdorff.) Note that Z,[[X]] itself is compact. 


Nakayama’s lemma. Let 0 be a local ring with maximal ideal m, and m-adic 
topology. Let V be a compact topological v-module. 


(i) IfmV = V then V = 0. 
(ii) If 0 is compact, and V/mV is finitely generated, then V is finitely 
generated by any set of representatives of V/mV. 


Proof. Let U be a neighborhood of 0 in V. Since V is a topological o- 
module, for each x € V there exists an open neighborhood U,, of x and a 
positive integer n(x) such that 


mo, < U. 


A finite number of neighborhoods U,, cover V. Hence there exists an integer 
n such that m*"V ¢ U. But mV = V implies m"V = V, and hence V < U for 
all U. Since V is Hausdorff, it follows that V = 0, which proves (i). 

For (ii), let x,,...,x; be representatives of V/mV, and let W be the 
o-submodule generated by them. Then W is a continuous image of 0, and 
is therefore compact, and closed. Then V/W is compact, and we have 


m(V/iW) = V/W. 
Hence V/W = 0, and V = W, thereby proving (ii). 


Next we pass to certain results concerning finitely generated modules over 
the Iwasawa algebra. These will be applied to computing orders of certain 
factor groups (which in §4 will be ideal class groups). The reader may omit 
the rest of this section if he wishes to disregard such computations for the 
moment and merely wishes to concentrate on general structural results. 

Two modules V, V’ are said to be quasi-isomorphic if there is a homo- 
morphism 

V>v' 
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with finite kernel and cokernel. It will be shown in §3 that any finitely 
generated V has a quasi-isomorphism with a finite product 


(*) V>A? OT [A @| [4/), 


where the f; are distinguished. The first factor A™ is the free part. The other 
factors are A-torsion modules. 

Suppose now that V is a torsion module such that V/h,V is finite for all 
n. We wish to get an asymptotic formula for the order of V/h,V. Such a 
formula does not change under a quasi-isomorphism, so we are reduced to 
consider the two cases when 


V=A/p" and V=A/f 


for some positive integer m, and fis distinguished. 
In the first case we have 


Alp™ = Z(p™)[[X]], 


the power series ring over Z(p”). In the second case, A/f is a free module 
over Z,, whose rank is deg f. It may happen in this second case that 


V,, = Vi?" a IV 


is not finite. We shall first make the assumption of finiteness to get the 
formula for the order, which is a power of p, so we put 


card V, = p*. where e, = e,(V). 


Theorem 1.2. (i) Jf V = A/p™ then e, = mp”. 


(ii) Let V = Aff where f is distinguished of degree d, and assume J, finite 
for all n. Then there exists a constant co such that for all n sufficiently 
large, 


e, = dn+ Co. 


(iii) If V is finitely generated over A such that V,, is finite for all n, then 
there exists a constant c such that 


e(V) = mp" + dn+ec 


for all n sufficiently large. In the representation of V as in (*) with 
r = 0, we have 


m= >m and d= > deg fF. 
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Proof. In case (i) we have 
Z(p")ULXIWCX + I" — 1) © ZE™XVC(X + Y* - db, 


and this is just Z(p")[T]/(T”" — 1), which is a free module of rank p* over 
Z(p”). Thus the computation of the order is obvious. 
Let us now look at case (ii). For any EA we let hy be the endomorphism 
of V induced by h. We let 
m=, w= kt. 
We have 
yr — 1 = (X¥ + I" — 1 = X" (mod p), 
f= X*% (mod p). 


Hence there exists 1) such that for n > ny we have 
X*"-?=Omod (fp) and Xf" = O0moda, 


and therefore 


pn-1 


Ya-1 = YF = 1 (mod p). 


It follows that 
Yn = Yr-1 = 1 (mod p?). 
Now 
Yer —~L= (ton tet en — YD 
=(1+1+4---+ 1+ O(p%))Q, - 1) 
= pu(y, — 1) 


where uw is invertible. We have therefore shown that 


(vn — IV = p®- "yn, — IV. 


Furthermore, (y,, — 1)V is of finite index in V, and is therefore a free 
module over Z, of the same rank d as V. This proves (ii). Case (iii) is then 
obvious, thus proving the theorem. 


Next we consider the case when V/h,,V is not necessarily finite, but make 
additional hypotheses which still allow us to compute the orders of certain 
factor groups asymptotically, and which are satisfied in the application to 
ideal class groups. 

We let 


Sn = Leyte tyr 
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We say that V is of Iwasawa type if there exist elements v,,..., v,€ V such 
that, if we put 


U, = Z,-submodule of V generated by (y — 1)V and y,,..., v,, 
U, = 8nUd, 


then V/U,, is finite for all n. In particular, V/U, is finite. For a module of 
Iwasawa type, we let 


V, = V/U,. 


Theorem 1.3. Assume that V is of Iwasawa type. Then the conclusions of 
Theorem 1.2(i), (ii), (iii) remain valid. 


Proof. Note that Case (i) is unchanged, only Case (ii) is now slightly 
different, but the proof runs along entirely similar lines as follows. In this 
case, V is Z,-free of rank d. An argument similar to that of Theorem 1.2(ii) 
shows that 


8nV = pr "Bn V 
for all n > no. Let W = g,,V. Then 


e(VignV) = e(V/W) + e(W/p*-"0W) = c, + d(n — no) 


for some constant c,, since W has the same Z,-rank as V. This proves the 
theorem since e(g,,V/g, Uy) < e(V/Up) and stabilizes for large n. 


§2. Weierstrass Preparation Theorem 


The proof of the Weierstrass theorem in this section is due to Manin [Man 1]. 
We start with the Euclidean algorithm. 


Theorem 2.1. Let 0 be a complete local ring with maximal ideal m. Let 
f(X) = > aX! 
1=0 


be a power series in o[[X]], such that not all a, lie inm. Say ao, .. ., An-1 € M, 
and a, € 0* is a unit. Given g € o[[X]] we can solve the equation uniquely 
g=aqftr, 
with q € o[[X]], re o[X], and degr <n — 1. 
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Proof. Let « and t be the projections on the beginning and tail end of the 
series, given by 


n-1 

a: > b Xt > b,X' = bo + by treet b,-1X"71 
i=0 

£2 DB X'S DS BX = By + Baer X + Bug X? Hoo. 
t=n 


Note that 1(X"h) = h for any h € o[[X]], and A is a polynomial of degree <n 
if and only if (2) = 0. 

The existence of g,r is equivalent with the condition that there exists q 
such that 


(g) = c(4f). 
But 
f= of + X*(f). 


Hence our problem is equivalent with solving 


a(g) = t(qa(f)) + t(gX"e(f)) = c(ga(f)) + arf). 


Note that t(f) is invertible. Put Z = qt(f). Then the above equation is 
equivalent with 


1(g) = “( ah) +Z= (1+ eo aD\z. 


Note that 


2s wp: o[[X]] > mo[[X]], 


because a(f)/t(f) € mo[[X]]. We can therefore invert to find Z, namely 


af, ) = 
Z={I+T° t(g), 
(1+ 93H) 
which proves both existence and uniqueness and concludes the proof. 


Theorem 2.2 (Weierstrass Preparation). The power series f in the previous 
theorem can be written in the form 


S(X) = (x" + Bye xX * era bo)u, 
where b,€ m, and u is a unit in o[[X]]. 
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Proof. Write 
X"*=qft+r, 


by the Euclidean algorithm. Then q is invertible because 


q=C + aX+-:- 
f=- t+ aX" 4+--- 


so that 
1 = coa, (mod m), 
and co is a unit in o. We obtain gf = X" — r, and 
f= q(x" — 1), 
with r = 0 (mod m). This proves the theorem. 


The integer 1 in Theorems 2.1 and 2.2 is called the Weierstrass degree of f, 
and is denoted by 


degy f- 


We see that a power series not all of whose coefficients lie in m can be 
expressed as a product of a polynomial having the given Weierstrass degree, 
times a unit in the power series ring. Furthermore, all the coefficients of the 
polynomial except the leading one lie in the maximal ideal. Such a polynomial 
is called distinguished. 


§3. Modules over Z,[[X]] 


The structure of finitely generated modules over Z,[[X]] was first determined 
by Serre [Se 1] who introduced this point of view in Iwasawa theory. As 
already mentioned, cf. Bourbaki for general structure theorems over integrally 
closed Noetherian domains. Paul Cohen showed how one could give a proof 
along the standard lines of row and column operations, cf. [L 3]. Robert 
Coleman pointed out to me that the inductive step as given in [L 3] had to be 
modified, and I am indebted to him for the exposition given in the lemma and 
Theorem 3.2 below. 

We let A = v[[X]], where 0 is a complete discrete valuation ring. We 
denote by p a prime element of o. By a finite module over » we mean a 
finitely generated module annihilated by some power p* and some distin- 
guished element 4. If 0 = Z,, then “‘finite” has the usual meaning. 

By a quasi-isomorphism we mean a homomorphism with finite kernel and 
cokernel. We denote a quasi-isomorphism by the sign 


Mw~ M’. 
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Theorem 3.1. Let M be a finitely generated A-module. There exists a quasi- 
isomorphism 


M~ AO] [Ap" OT [4/4 


where each f; is a distinguished polynomial, irreducible in o{ X], i, j range 
over finite sets of indices, and A is the product of A taken r times, for some 
integer r. 


The rest of this section is devoted to the proof. 


Suppose that M has generators u,,..., u,. Relative to such generators we 
can form the matrix of relations, whose rows are vectors 
(Ai, erect An) 
such that 


Ayu, t-+++ Au, = 0. 


Since A is Noetherian, a finite number of the rows generate all of them. 
Performing the usual row and column operations on the matrix amounts 
to changing the generators of the module. We shall describe other operations, 
corresponding to embedding the module in a bigger one with finite cokernel. 
An element A €A is called p-free if 2 does not lie in pA, in other words, if 
we can apply the Weierstrass preparation theorem to it. 
Suppose that there is a relation of the form 


Ayu, + p(Agle +++++ Anta) = 0, 


where A, is p-free. We can form the new module M’ obtained by adjoining a 
new generator v with the relations 


pv = uy, Ay = — (Agus SB ee AnUn): 
This can be formalized by considering a direct sum 
MO(v) 


modulo the desired relations, i.e., modulo the submodule generated by the 
elements 


(0, pv) ~~ (1, 0) and (0, Av) a (Ague ein Antns 0). 


It is then immediately verified that the canonical map of M into the factor 
module is injective. The factor module M‘'/M is annihilated by p and 4,, 
whence is finite. Furthermore, the elements v, w2,..., u, generate M’, and 
have the relation 


Ayo + Agua +++ + Agu, = 0. 
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In terms of the relation matrix, this means that we shall allow the following 
operations, replacing the matrix R by a matrix R’. 


O 1. If R contains a row (Aj, pio, ..., pan) with A, not divisible by p, then 
we let R’ be the matrix whose rows consist of 


(A, ae ietelg An) 
and the rows of R with first element multiplied by p. 


Observe that in this first operation, we may have 1, =---= A, = 0. 
Next suppose that some power p* (k > 1) divides all elements of R, but 
that there exists one relation 


P* (Ax, - «+s An) 


such that A, is distinguished (or equivalently, 1, is not divisible by p). We may 
then form the module M’ obtained by adjoining a new element v with the 
relations 


pYv = p*u, and Ay = —(Aguée +--+ + Aqlty). 


Again, it is easily verified that M is embedded in M’ and that M’/M is finite. 
Note that p*(v — u,) = 0. The relations of the submodule 


(V, Ue, .. +5 Un) 
are generated by R and the additional relation 
(Ay, - +> An)- 
We have a direct sum decomposition 
M’' = (v, ua, ..-, Un) @ (v — uy), 


and the relations of v — u, are generated by p*. To prove the theorem, it 
suffices to consider the first component of M’. Thus our second operation is 
described as follows. 


O 2. If all elements of the first column in R are divisible by p*, and if there 


exists one relation (p*d,,..., p*A,) such that A, is not divisible by p, then 
we let R' consist of R and the new row 


(Ai, «+5 An). 
Finally we allow one more operation: 
O 3. If R has a relation of the form 
BY(Ax,..-54n), Kk = 0, 
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and there exists an element 4 not divisible by p such that 
(AAy, . «5 AAn) 


is also a relation, then we may replace R by the matrix R' having the same 
rows as R, except that the row p*(A,, ..., An) is replaced by 


(Aa, -- +5 An). 
This operation corresponds to the surjection with finite kernel 
M— M/(Ayu, +--+ + Anta). 


Row or column operations, or O1, O 2, O3 will be called admissible 
operations. 
Given a matrix R over A, we define 


deg™(R) = min degy(a,) for i, j = k, 


where (aj,) ranges over all admissible transformations of R which leave 
unaltered the components of the first k — 1 rows. 


Remark. If R’ is obtained from R by admissible operations leaving the 
values in the first k — 1 rows unaltered, then 


deg™(R) < deg™(R’). 


Let r => 1 be an integer. Suppose that R has the form 


0 Ap-1,1-1 0 eed 0 
* * * * 


and 
re 
(7 0 T) 0 
Assume also that 4, for i = 1,..., 7 — 1 is a distinguished polynomial with 
the property that 
deg(R) = dega,, fork =1,...,r—1. 


Then we shall say that Ris in(r — 1)-normal form. Ifr = 1 then this condition 
is vacuously satisfied, and is the starting point for the induction of the 
following lemma. 
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Lemma. Suppose that R is in (r — 1)-normal form, with its first r — 1 
diagonal elements Ay, ..., A¢—1,.-1- Then by admissible transformations, we 
can transform R into a matrix which is in r-normal form, and has the same 
first r — 1 diagonal elements. 


Proof. Using O 1 with respect to each of the first r — 1 rows, we may 
assume without loss of generality that any given power p* (k > 0) divides 
all components 4,, with i > r and j = 1,...,7r — 1, that is all components 
lying below the portion of the matrix which has already been diagonalized. 
Using O 2, we may then arrange that p does not divide some 1,, with i > r, 
andj > r. After a succession of admissible transformations on the lower right 


matrix 
Ais Berta 
SRR A RR RE 
induced by admissible transformations of R which leave the first r — 1 rows 


fixed elementwise, we may then find some element 4,, with i> randj>r 
such that 


degy 4, = deg(R). 


The Weierstrass preparation theorem allows us to assume that this element 
A is a distinguished polynomial, and 


deg Ay = deg(R). 


Finally, row and column interchanges which do not involve the elements J,; 
(i = 1,..., 7 — 1) allow us to assume that 4, = 4,,. 

There remains to show that we can make all other elements on the rth 
row equal to 0 after appropriate transformations. By the Euclidean Algorithm, 
we may assume that 


deg A,; < deg d/,, forr #4 / 
deg 4,; < deg A,, for j < r. 


We first deal with the elements to the right of 4,, on the rth row. We may 
assume that 4,; with j > r is divisible by p, otherwise we contradict the 
minimality of the degree of 4,,. Using O 1 repeatedly as before with respect 
to the first r — 1 rows, we may then assume that all elements 1,, with j < r 
are divisible by a high power p*. We then use O 1 with respect to the rth row, 
to divide all elements /,; (j # r) by successive powers of p, thus leading to 
some element 4,; with j’ > r not divisible by p, a contradiction of deg A,, = 
deg(R). Thus 4,; = 0 for j > r. 

For elements 4,,; to the left of ,,, that is with j < r, if some such element 
is not 0, then we may use O 1 with respect to the rth row to divide by p, 
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until we are in the situation where there exists j < r such that 4,, is not 
divisible by p, contradicting the facts that 


deg i,; < deg4,,; and degd,, = deg”(R). 
Thus we have put the matrix in r-normal form, and proved the lemma. 


Theorem 3.2. Jf R is a matrix of relations, we can transform R with a finite 
number of admissible operations into a matrix R' of the form 


where 1, are distinguished polynomials. 


Proof. By the lemma, we can replace R by a matrix R’ of the form 
Ay, 0---0 0---0 

OO ee ero ee 0 

* Kee k 0.--0 


where A,, are distinguished polynomials, and 
deg Ay = deg®(R) fori = 1,...,r. 
By the Euclidean Algorithm, we may assume that 2, = 0 or 
deg A, < deg A, for j # i. 


In fact, we contend that A,, = 0 for all j 4 i. Suppose otherwise, so that 
A, # 0 for some j > r > i, so we have a relation 


Aji, - + +> Ayr, 0,..., 0) 
not identically 0. Let 
A= Ayr: + Age 
Then / is not divisible by p, and Au, = 0 fori = 1,..., 7, so 
(Aaj, .- +5 AAs, 0, ..., 0) 


is also a relation. By O 3 we may assume without loss of generality that some 
Aji) +++» Agr is not divisible by p, and then contradict the minimality condition 
on the A,,. This proves the theorem. 
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We return to the module interpretation, to see that Theorem 3.2 implies 
the theorem. Indeed, any module with matrix of relations R’ as in Theorem 
3.2 is isomorphic to 


A ® ) Al(Ay). 


Finally, if £ g are distinguished and relatively prime, the map 


A(fg) > Alf © Alg 


is an embedding with finite cokernel. This allows us to decompose the factors 
A/dA,4 into a direct sum of factors 


A/(f;") 


where f; is distinguished and irreducible, thereby concluding the proof of 
Theorem 3.1. 


§4. Z,-extensions and Ideal Class Groups 


Let Ky be a number field. An extension K., of Ko is called a Z,-extension if 
it is abelian, and its Galois group is isomorphic to Z,. To give such an 
extension is the same as to give a tower of fields 


Ke = |) Ki 2-1? Ky > ++? Ko 
n=0 


such that K,, is cyclic over Ky of degree p”. 
Examples. Let p be a prime number. Let 


K, = Q(u,"+1) if p is odd 
K, = Q(up"+2) if p is even. 


This gives the cyclotomic Z,-extension over the field Ko. 
More generally, let K be any number field, let 


K® = K(u™) 


be the extension obtained by adjoining all p-power roots of unity. Then K™ 
is abelian over K, and it is easy to see that the fixed field of the torsion sub- 
group of Gal(K™/K) is a Z,-extension of Ky = K, called the cyclotomic 
Z,-extension. We study it later in the book. Note that a non-totally real field 
K always has non-cyclotomic Z,-extensions, cf. §5. Natural examples can be 
constructed with elliptic curves having complex multiplication, cf. [C—W]. 
We say that a prime ideal », of Ko is almost totally ramified in a Galois 
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extension K’ if the inertia group of a prime p in K’ over pg is of finite index 
in Gal(K’/Ko). We say ) is almost unramified if its inertia group is finite. 
We consider the following condition of Iwasawa. 


IW. K., is totally ramified over Ky over a finite number of prime ideals 
p,,..., ps Lying above p, and is unramified over all other prime ideals. 


Lemma. Let K../Ko be a Z,-extension. Then: 


(i) Only a finite number of prime ideals of Ky ramify in K., they lie above 
p, and they are almost totally ramified. 

(ii) For some positive integer d, the extension K/Kz is a Z,-extension 
satisfying IW. 


Proof. Some prime ideal » of Ky must ramify in K., because class field 
theory says the maximal unramified abelian extension of Ko is finite. Let J 
be the inertia group. It is a closed subgroup of I’, and 40, hence equal to 
p"Z, for some m, so that p is almost totally ramified. Over the completion 
Ko,p, the maximal tamely ramified abelian extension is finite. Hence the wild 
ramification group is of finite index in I’, thus showing that p lies above p. 
This proves (i). If we let »,,..., », be the finite number of primes which are 
almost totally ramified: and let 


L, z p%Z» 


be the inertia groups, and d = maxd,, then K.,/K, satisfies condition IW 
as desired. 


Assume that condition IW is satisfied. 

The same lemma as in Chapter 3, §4 shows that the norm map between 
any two successive steps in the tower is surjective on the ideal class groups. 
We let C, = C® be the p-primary part of the ideal class group in K,. Then 
we have a surjective sequence 


CO<Ci<C.<::: 


C = lim proj C, 


be the projective limit. We may view C as consisting of all sequences 


and we let 


(co, C1, Ca,-- .) 


with c, € C, and c,,, mapping on c, under the norm map. 
Let M,, be the maximal p-primary abelian unramified extension of K,, in 
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other words the p-primary part of the Hilbert class field of K,. There is an 
isomorphism given by class field theory 


C, ~ Gal(M,/K,) 
such that the following diagram is commutative. 
Casi > Gal(My41/Kn+1) 
worn [recon 
Cy, ——> Gal(M,/K,) 


Since K., is totally ramified over K,, it follows that M, is linearly disjoint 
from K.. over K,. The lattice of fields looks as follows. We let Ma. = U My. 


We let 
G = Gal(M../Ko) and Go = Gal(M../K.) = C. 


Remark. If we replace Ky by K, then K., over K, satisfies the same con- 
dition IW, so a number of results proved for K.. over Ky apply a fortiori to 
Ka over K,. Observe that if y is a topological generator for I, then 


Gal(K../K,) = F?" = {y?"} & p"Zp. 
Theorem 4.1. Assume first that YW is satisfied with s = 1. Let I be the 
inertia group of any prime above » in G. Then: 


(i) G = IG¢ is a semidirect product, and the restriction of I to K~ gives 
an isomorphism of I and I. 
(ii) The commutator group G’ = Gi}. 
(iii) We have isomorphisms 


C/C’"* & Co © Gal(Mo/Ko) © Gal(KaMo/Ka) © Go/Ge7?. 
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Proof. We have an exact sequence 
1>+Gpe>Gor->l. 


The image of Jin I by restriction to K,., is surjective because K,, is totally 
ramified over Ko. It is injective because M,, is unramified over K,, and so 


INGe = {1 


This proves (i). If ¢ € Gg then o’~1 is a commutator because I operates on 
Ge by conjugation. Hence G2! < G’. On the other hand, G/G%~? is abelian, 
so the reverse inclusion also holds and (ii) is proved. Finally, M> is the 
maximal p-primary abelian unramified extension of Ky and so Gal(M./Mo) 
is the smallest subgroup of G containing G’ and the inertia group J. Since G 
is the semidirect product of J and Gg, we see that (iii) follows from (ii), and 
conclude the proof of the theorem. 


Corollary. We have an isomorphism 


C, = Gal(M,/K,) © C/C’"-1 = G/GP"-}. 
Proof. Apply the theorem to the situation where Ko is replaced by K,. 


Next consider the general situation with a finite number of primes. 


Theorem 4.2. Assume that IW is satisfied, with primes ),,..., ~;. Let I; be 
the inertia group of »; in G. Then: 


(i) There is a semidirect product decomposition 
G —_ 1,Ge, 


and G’ = Gtr}. 
ii) Let o, be a generator for I,, and write 
j j 


0; = TO, with T; E Ge. 
Then 
Co ~ Gel(t1, ssey Ts, oe =): 


Proof. Identical with that of Theorem 4.1, except that in the present more 
general situation, we have to look at the smallest subgroup of G containing 
the commutator group G’ and all the inertia groups J, instead of a single 
inertia group J. 


Corollary. Let Uy be the Z,-submodule of Gc generated by the elements 
1 ,..., 7, and G1. Let 


U, = Ug» whereg, =1L+y+ yp? +--+ + yprrn}. 
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Then 
C, % Ge/Up. 


Proof. We apply the theorem to K.. as Z,-extension of K,. This has the 
effect of replacing y by y?” and o; by o;,”". Then 1, is replaced by (t,)%, because 
for every positive integer k, we have 


of = (t,0,)* = 1,017,601) -oft;0, 7: - of *t,07 **}-of, 
whence for k = p" we obtain 
af” = (t,)%- of". 
Then 


U, = GP" 4a, tees Ts)%n 
where (t,,..., T;) is the group generated over Z, by 7,,..., t,. Since 


&(y — 1) =" — 1, 
we find U,, = U§=, which proves the corollary. 


It will be easily shown below in Theorem 4.4 that C is finitely generated 
over the Iwasawa algebra. Then Theorem 4.2 and its corollary show that C 
is of Iwasawa type as defined in §1, so that one can apply the counting 
procedure given there, to get an asymptotic formula for the orders of the 
groups C,. 

Iwasawa has conjectured that m = 0 in the case of the cyclotomic tower 
Q(u,*), So that in this case, the order of the ideal class group (p-primary part) 
would have the form 


Card C, = p%*°¢ 


for n sufficiently large, in analogy with the orders of points of p-power order 
on abelian varieties. This conjecture has recently been proved by Ferrero and 
Washington. On the other hand, he has given examples of non-cyclotomic 
Z,-extensions K,, over Ky for which m > 0. 


Theorem 4.3. Assume that IW is satisfied with one prime. If Cy = {1}, then 
C, = {1} for all n. 


Proof. If C, = 1, then Theorem 4.1 shows that C = C’-1. Viewing C as 
module over Z,[[X]], this means that C = XC. But X is contained in the 
maximal ideal of Z,[[X]]. By Nakayama’s lemma, it follows that C = {1}, 
as desired. 
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Remark. We could let K,, = Q(,.+:)" be the real subfield of the cyclo- 
tomic field. It is a conjecture of Kummer-—Vandiver in that case that Cy 
is trivial. [Remember: By definition, Cy = CY) in this chapter.] Thus the 
Kummer-—Vandiver conjecture may also be formulated by saying that 


hg is prime to p, 


and Theorem 4.3 shows that if this is the case, then h7 is also prime to p 
for all n. 

For historical comments on the Kummer—Vandiver conjecture, see the 
Introduction. Kummer usually denoted h* by D/A, where D and 4 de- 
note the regulators of the group of units and cyclotomic units respec- 
tively, so their quotient is the index equal to h* by Theorem 5.1 of 
Chapter 3. Cf. also the discussion in [L 5]. 


Theorem 4.4. For any Z,-extension the module C over Z,[[X]] is a finitely 
generated torsion module. 


Proof. Suppose first for simplicity that condition IW is satisfied with only 
one prime. Then C/mC is a factor group of C/C’~?, which is none other than 
C, by Theorem 4.1, and is therefore finite. That C is finitely generated is a 
special case of Nakayama’s lemma. 

In general, when IW is satisfied but with several primes, then we have to 
use another argument. By Theorem 4.2 we know that 


Go/Gi-* x C/C*} 


is finitely generated over Z, of rank uniformly bounded by s. Nakayama’s 
lemma again shows that C is finitely generated over the Iwasawa algebra. 
Furthermore, applying Theorem 4.2 to K./K,, that is replacing y by )?", 
shows that 


Ch(y* — NC 
is also finitely generated over Z, with a similar bound s for the rank. By the 


structure theorem of §3, if V is finitely generated over A, then there is a quasi- 
isomorphism 


(*) V>A” OT [A/e™ OT [ A/V), 


where f; are distinguished. We use this with V = C, writing V additively. 
The uniform bound on the rank immediately shows that there cannot be any 
free part, i.e., r = 0. This proves Theorem 4.4. 
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§5. The Maximal p-abelian p-ramified Extension 


The next two sections describe in class-field theoretic terms some properties 
of the Galois group of the maximal p-abelian p-ramified extension of a number 
field, and describe its Z,-extensions. 

Let K be a number field. We let: 


M,(K) = the maximal p-abelian p-ramified extension of K. 


M3*(K) = the maximal p-abelian unramified extension of K. 


We fix the prime number p and the field K, so we sometimes omit reference 
to them in the notation. 


J = Jx = ideles of K, and U is the group of unit ideles, 


U=[[U, and Je =[] Ke. 
p 


VES wo 


In the first product, » ranges over the prime ideals of K. We write 


U,=[[Up, and Up, =[ | U. 


pip ep 
E = Ex = units in K. We have an embedding on the diagonal: 
0,: E— U,. 
G3°(K) = Gal(M,(K)/K). 
By class field theory, an abelian extension of K is unramified at primes 


dividing / if and only if its associated group in the ideles contains U,. Conse- 
quently we have an isomorphism 


G3°(K) & p-part of Jg/Ujp)J °K* 


where the bar denotes closure in the idele topology. We have the inclusions 
J > U,U pI? K* > Up J”? K*. 
The first factor group 
J/U,U pI? K* = J/UK* 


is isomorphic to the Galois group of the Hilbert class field, and is finite. 
The second factor group is equal to 


U,U p)J °K*/Uip J °K* &% U,/U, 0 Upp J? K*. 
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Lemma. U, 0 Ups °K* = 0, £- 


Proof. Let U{” be the group of units in U, which are =1 mod p”. Then 
the groups 


UU J °K* 


form a fundamental system of neighborhoods for U,»)J”K*, and their inter- 
section is this closure. Intersecting with U, (whose elements have component 
1 at all primes not dividing p) shows that 


UP Up) J °K* OU, = EUS. 
Taking the intersection for all proves the lemma. 


Theorem 5.1. Let H be the p-Hilbert class field of K. Then we have an 
isomorphism 


Gal(M,(K)/H) © p-part of U,/opE 
= US(US 2 o,E). 


Again, as p is fixed, we write simply U,/E. By a quasi-isomorphism, we 
shall mean a homomorphism with finite kernel and cokernel. We denote a 
quasi-isomorphism by a single ~. The theorem yields a quasi-isomorphism 


G3"(K) ~ U,/E. 


Furthermore, since U, contains an open subgroup of finite index isomorphic 
to Zi* :®, by means of the exponential map, say, we have a quasi-isomorphism 


G3*(K) ~ ZK :@-", where r, = rankz,E = r,(E). 


The Leopoldt conjecture states that r, = r =r, + r2 — 1. 
Let Z,(K) = composite of all Z,-extensions of K. From the quasi- 
isomorphism we find: 


[M,(K) : Z,(K)] < ©. 


Theorem 5.2. Assume the Leopoldt conjecture for K. Then we have a quasi- 
isomorphism 


G3(K) ~ ZP** & Gal(Z,(K)/K). 
Proof. The first statement comes from the definitions and 
[K: Q] = ry + 2re. 
For the second statement, we note that the composite of all Z,-extensions of 


144 


§6. The Galois Group as Module over the Iwasawa Algebra 


K has a Galois group embedded in the product of Z, with itself, and as such 
is a torsion free finitely generated module over Z,, whose rank is exactly 
r2 + 1 by the first statement. 


Example. Let K.. = Q(u™). Let U, be the local units in the completion 
of K,, congruent to 1 mod p,. Let U;, be the subgroup of units whose norm 
to Q, is 1. Assume the Kummer—Vandiver conjecture. Let the notation 
be as in §4 of the next chapter. Then we obtain an isomorphism 


Gal(Q/Q") ~ lim U%/E,. 
<— 


Without assuming the Kummer—Vandiver conjecture, we shall study the 
projective limit of the local groups in Chapter 7. 


§6. The Galois Group as Module over the Iwasawa Algebra 
Let K, be a number field, K.,/Ky any Z,-extension, with Galois group 
r= {y}, 


with topological generator y. Let Q be a p-abelian extension of K., which is 
also Galois over Ko. For each n we let Q, be the maximal subfield of Q which 
is abelian over K,. 


tl 


Ae 
rd | 
Ko 


The Galois groups are denoted by the letters shown on the diagram. Since Q 
is assumed Galois over Ky and is abelian over K., it follows that the com- 
mutator subgroup is 


Gal(Q/Ky)° = G’~}, 
in other words, it consists of all elements 
o’-1 = ayo 1y71 with o EG. 
It is frequently useful to view G as an additive module over the Iwasawa 
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algebra. Indeed, I, operates by conjugation on Gal(Q/K,), and hence on the 
commutator group 


Gal(Q/K,)”"-1, also written (y?" — 1) Gal(Q/K,). 
Hence 
lim G, = G is a compact module over A = Z,[[X]] = lim Z,[I,]. 
Taking K, as ground field instead of Ky, we obtain mutatis mutandis 
Gal(Q/Q,) = (y"” — DG = (1 + X)”" — 1G. 
Thus in terms of the Iwasawa algebra, we find 
G, = G/(y” — 1G. 
We denote by the sign ~ a quasi-isomorphism of A-modules. 


Theorem 6.1. Let Q be the maximal p-abelian p-ramified extension of Ka. 
Then: 


(i) G = Gal(Q/K..) is finitely generated over the Iwasawa algebra, and in 
fact 


G/G’"1 ~ Z where p = [Ky:Q] — 7, — 1. 
ii) If Ko satisfies the Leopoldt conjecture, then p = re, and 
G/XG ~ Za. 
Proof. By definition, 
29 = M,(Ko), 


and the rank over Z, of a subgroup of finite index in its Galois group was 
determined to be [Ky : Q] — r, in Theorem 5.2. Taking into account I itself 
shows that G/XG ~ Zé where p is as stated. Nakayama’s lemma then proves 
the first assertion, and (i). Part (ii) is then a matter of definitions. 


Theorem 6.2. Assume that Ky is totally imaginary, and that each K,, satisfies 
the Leopoldt conjecture (namely 


1(Kn) = 12(Kn). 
Then there is a quasi-isomorphism 

G ~ A X Gror, 
where Gior is the A-torsion submodule of G. 
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Proof. From the structure theorem, we know that 
Gw~ At x Gor. 
On the other hand, 
ro(Ky) = rap". 
By Theorem 5.2 we know that 
Gal(Q,/K,) ~ ZpP"*}. 


From the structure theorem, one sees easily that this is possible only if t = re, 
as desired. 


The above theorems give a sample of Iwasawa’s results [Iw 12]. It is 
possible to vary some of the hypotheses to obtain variants. For instance, 
one need not assume the full Leopoldt conjecture in Theorem 6.2, merely 
assume that the defect in that conjecture is bounded as function of n. For 
the cyclotomic Z,-extension, this can be proved easily, see for instance Green- 
berg [Gr 4]. 
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Z,-extensions 


In the last chapter we studied the ideal class groups in a Z,-extension of a 
number field. Here we shall consider especially the cyclotomic Z,-extension, 
and then Kummer extensions above it, as in Iwasawa [Iw 12], obtained by 
adjoining p"th roots of units, p-units, and ideal classes of p-power order. 

We give Leopoldt-Iwasawa’s theorem that the Kummer-—Vandiver 
conjecture implies that C™ is cyclic, in a precise version for the cyclo- 
tomic extension of Q, following Kubert-Lang [KL 9]. We prove that the 
Galois group of the Kummer extension obtained by adjoining p-power 
roots of p-units is 1-dimensional free over the Iwasawa algebra. As a 
consequence, we see that C”™ is a quotient of this free module. See 
Leopoldt [Le 5] and the last Satz in [Le 10], as well as Iwasawa [Iw 7], 
Theorem 2. In the limit, there is an analogous (but less precise) statement 
of Greenberg [Gr 4], see also Coates [Co 3], Theorem 5.7. 

For a discussion of the case of totally real number fields, cf. Coates [Co 3], 
[Co 4]. In this connection it is likely that the units conjectured by Stark [St] 
(see also Lichtenbaum’s conjectures [Li 2]) will play a significant role similar 
to the one played by the cyclotomic units, to clarify the situation. 


§1. The Cyclotomic Z,-extension 


Let yp be the group of p-power roots of unity. Then Q(y”) is the composite 
of an extension of degree p — 1 if p is odd, Q(?) if p = 2, and a Z,-extension 
which is uniquely determined as the fixed field of the (finite) torsion group of 
the Galois group, and will be called the cyclotomic Z,-extension. We denote 
it by Z,(Q). It is real. If K is a number field, we let 


Cyc,(K) = KZ,(Q) 
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be the composite of K and the cyclotomic Z,-extension. Then Cyc,(K) is a 
Z,-extension of K. If K is totally real, then this cyclotomic Z,-extension is 
also totally real. 

Suppose on the other hand that K contains the pth roots of unity if p is 
odd, and contains i if p = 2. Let qo be the power of p such that the goth roots 
of unity lie in K. Let 


gn = Gop" and K, = K(u,,). 
Then [K,.1: K,] = p, and 
Ko =k, 
is Z,-extension of K. Let © = Gal(K./Ko) and let 
“4: DP >14+ QLy 
be the canonical representation such that for any p"th root of unity € we have 
he Cm, 


A Galois extension is called p-abelian if its Galois group is a projective 
limit of finite p-abelian groups. We now discuss properties of such extensions 
of K.. which are Galois over Ko. 


For the rest of this section, we assume that Ky contains the pth roots of 
unity if p is odd and iif p = 2. Let A, be a subgroup of K*, and let 


I, = Gal(K,/Ko), An = Zp"). 
We assume that A, is stable under I,,,. 
Ordinary Kummer theory gives a pairing 
Gal(K,(Aq!")/Kn) x An'?"/(An!?” 0 Kp) > bye 
expressed by the symbol 
(6, &) > <6, ®, = oa/a 


for o in the Galois group and « € Al/?". If ye, then 


(07, On = (0, OA = Co, aR, 


where o’ = jo}—', and * is any extension of y to K,(A}/?"). Indeed, 


FoF “Gayfra = 9(F) = »(Z)- 


a 
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We may also write 
Ko, aye = Co, o>, 


The group Ai?" mod Ai'?" ~ K;* has exponent p”, so exponentiating with a 
p-adic integer is well defined. In particular, we may rewrite the functorial 
formula in the form 


(o", On = CG, Oo ns where ig = y*x(y). 
We wish to pass to the limit. We could have taken the Kummer pairing on 
Gal(Kn(Ai”")/Kao) x Ad!?"/(AL?" A Ka*) > Up, 


writing the symbol <a, «> without an index n, defined by the same formula. 
The Galois group on the left can be identified with a subgroup of 
Gal(K,(A}'?")/K,), arising from the change of base of the Kummer extension 
from K, to K~. Let G, be the Galois group on the left, so 


n = Gal(K..(Aj!”")/Kao). 
The field diagram is as follows. 
K.(An”) 
G, | 


K,(Ai!?") 
Ke Se | 
K, 
lr, 


Ky 


The group G, is a ',-module, hence a Z(p")[I’,] = A,-module. Hence via the 
natural homomorphism, it is a A-module, where 


A=lmA, 


is the Iwasawa algebra, isomorphic to Z,[[X]], and X¥ = yo — 1, where yp isa 
fixed generator of I. 
Let 


A = > mX', m,é€ Zy 
be an element of A. We define the Iwasawa involution 
A* = > m,X*, where X* = x(y.)(1 + X)71 - 1. 
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Then X* is also in the maximal ideal (p, X) of A, and 
Ar A* 


is an automorphism of A. The functorial formula for the action of ye I on 
the Kummer symbol can then be expressed in terms of the involution by 


<o7, a> = <a, a", 


for o €G, and « € Al/?"/(Al/?" A K*). 
In the applications, we also pass to the limit on ” for the Kummer pairing. 
We suppose that 


Ay & Ans1 
Let 


Q, = () K.(Ad") and Gy = Gal(Q,/K~). 


We have a compatible system of pairings for m > n: 


Gp x ALP*(ALP™ AV Ka) > Myr 


| | | 


Gy x An'"(An” 0 Ka*) > Hot 


The Galois groups on the left form a projective system, and the Kummer 
groups of field elements on the right of the pairing form an injective system. 
At each finite level, we have a compact-discrete duality. In the limit, we have a 
similar compact-discrete duality 


Gy x lim AL?"/(ALP" A K*) > p 
— 


with values in the p-primary roots of unity. 

The action of A, on G, is compatible in the projective limit, so the limit 
group G, is a topological compact A-module. We shall investigate its structure 
for various systems {A,} obtained from units and ideal classes in the next 
sections. It will also happen that we consider two groups, say 


A> B, 


in which case Q, > Qa. It is clear in each case that Gal(Q,/Q,) is a A-module, 
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and that the Kummer pairings and involution described above also apply to 
this intermediate situation. 


§2. The Maximal p-abelian p-ramified Extension of the 
Cyclotomic Z,-extension 


A Galois extension is called p-abelian if its Galois group is a projective limit 
of finite p-abelian groups. It is called p-ramified if it is unramified at all primes 
(including infinity) not dividing p. We let: 


M,(K) = the maximal p-abelian p-ramified extension of K. 


M2"(K) = the maximal p-abelian unramified extension of K. 


We fix the prime number p and the field K, so we sometimes omit reference 
to them in the notation. 

Even if K is infinite over Q we may define M,(K) and Ms"(K) as above. 
It is then immediate that 


M,(K) = \_) M,(F), 


where the union is taken over a family of subfields F of K finite over Q, whose 
union is K, and which is cofinal with the family of all subfields of K finite 
over Q. For instance, if K is finite over Q, and K.. is a Z,-extension, then 


M,(Ko) = LU) M,(K,). 


A similar remark applies for M3"(K.). 
Throughout this section, we let: 


K.» = cyclotomic Z,-extension of Ko, and we assume that Ko contains 
the pth roots of unity if p is odd, contains i if p = 2. 
Q = maximal p-abelian p-ramified extension of Kx. 
E, = units in K, and E = U &,. 


Qe ae! K..(E"?*) 
A, = group of elements a in K* such that («) = a?” where a is (frac- 
tional) ideal prime to p, and A =  A,. 


Q, = U Ka(Ai'”") 


B,, = p-units in K, = group of elements whose ideal factorization con- 
tains only ideals dividing p. 


Q, =, K..(B"?") 
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We have the following diagram of fields. 


Q 


Qa Qe 


Qs 
K. 
It is clear that Q, and Q, both contain Q,. In fact, both A and B contain E. 


Lemma 1. Q = Q,Q,. 


Proof. By Kummer theory, Q is a composite of cyclic extensions. Let 
K.(a1!®") < Q for some ae Kj. Then «€ K, for some n. We take n > m 
and also such that 


K,(a"/") is p-ramified over K,. 
Then a necessarily has an ideal factorization 
(a) = aP"b, 


where b is p-primary and a is prime to p. Let h be the class number of K,, 
and write h = p’d with d prime to p. Then 


(a") = (a,)?"(B) 
where (a,) = a” and (8) = 6”. Furthermore, 


Kas (acrlP™ a) = Kas (acalP = K, + r(at!?”) 
and also 


Kale’) S Kaa (arirn’, BYP", Be). 


This proves the lemma. 


Theorem 2.1. The Galois groups Gal(Q4/Q,) and Gal(Q,/Q;) are A-torsion 
modules. So Gal(Q/Q;) is a A-torsion module. 


Proof. We shall analyze each Galois group separately, and get a closer 
view of its structure. 
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The extension 2,/Q;. 
Let Gajz = Gal(Q,/Q,). For now abbreviate G4); = G, and let 


G, = Gal(Qg(47'”)/Qz). 


The field diagram is as follows. 


Q(An'”") 
a 


Qe 


| 


Ke 


It is clear that G = projective limit of the groups G,, and that G, is a 
Z(p")[I’,]-module, so in the limit, G is a A-module. 
As in Chapter 5, let: 


C, = Cl®(K,) = p-primary subgroup of ideal class group of K,. 
Then we have a homomorphism 
Aw?” > Cy 
given by 
alr” —> 


if (a) = a?". If wis a unit in Q such that u®" € A,, then uw?" € E,. The kernel 
of our homomorphism is therefore precisely E}/?", so we have an injective 
homomorphism 


AUPE —> Cry 
which is also a A-homomorphism. Let .%, be its image. Then the Kummer 
pairing is isomorphic to a pairing with A,, namely: 
G, x AMP"/ELP™ —> Hy? 
Ge x Sy ———> Hp" 
In addition, this isomorphism is compatible with the limiting process: 


Gnat X Sna1 —> Hpnet 


| | | 


G, X Sn — > [p" 
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Hence we get a compact-discrete duality 
Gx F> pp” 


where »/ = direct limit of 4. By Chapter 5, Theorem 4.4, there exists 
A¢€A such that C? = 1,so C2 = 1 forall n,and A? = 1 foralln. By Kummer 
duality, for o € G we get 


<o*", a> = <o,a*> = 1 for all ce A}/”" and all n. 


Hence o** = 1, so A* annihilates G, which is therefore a torsion module over 
A as desired. 
In addition, we note that the direct limits 


lim Al!?"/E}/?" = lim % and limC, = C, 
> > —> 


are equal since any element in C,, has a representative ideal prime to p. 
Consequently we get the additional information: 


Theorem 2.2. The Kummer pairing gives rise to a compact-discrete duality 
Gal(Q,4/Qz) x Co > pw. 


Remark. Iwasawa has also shown that C = lim C, is quasi-isomorphic to 
<_ 
Hom(lim C,, Q,/Z,) (see Theorem 11, p. 266 of [Iw 12]). 
—> 


The extension 2;/Q;. 


Let Ggjz = Gal(Q;/Q;). For now abbreviate Gz,; = G. By the Lemma of 
Chapter 5, §1 we know that there is only a finite number of primes ,,..., p, 
dividing p in some finite extension Ky, such that »,,..., p, are totally ramified 
in K,,. Let h be the class number of K,. Let 


pl = (m),---, Ps = (75). 
Then 

Q, = Q,(ni!?",..., m2/?”). 
It is immediate that 


Gal(Q;/Q;) ~ Zs with s’ < s. 


In particular, the structure theorem for finitely generated A-modules implies 
that G cannot have any free part, so is a A-torsion module. This proves 
Theorem 2.1. 
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For additional information concerning the fixed field of 
Gal(Q/K tor 


(for referring to A-torsion), cf. for instance Coates [Co 1], Theorem 5. Iwasawa 
has an example showing that there are cases when the fixed field is not 
necessarily Qz,. 


Theorem 2.3. Assume that there is only one prime in K. lying above p. 
Then 


Q; = Qz, = Qs 
where E, is the group of p-units in K., and 

Qe, = Q(E5'?*). 
Proof. We consider the diagram of fields: 


K,, 


a aes =Q, 


K 


Q 


The ideal above p in Q, is principal, say generated by the element A, = 
1 — ¢,. The degree [K,, : Q,] is bounded independently of n, and we have 


Cn) = Pym 


where e, is the ramification index, bounded by this degree. Taking e to be the 
least common multiple of the integers e, shows that p§ is principal for all n. 
We apply this to the previous discussion of the extension Q, = Q,(n}!?”). 
As ideals we have 


(m1) = pin 


for n sufficiently large, and j, is divisible by arbitrary large powers of p as 
n—> oo. Furthermore 


Q, = Q,(nf?*). 


It is then clear that Qg = Q,. 
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§3. Cyclotomic Units as a Universal Distribution 


Let p be a prime number. 
Let &, be the group generated by +p (p-power roots of unity) and by 
the elements 


¢-—1, withf€eu™ andl #1. 


We call &, the cyclotomic p-units. They satisfy the following relations: 


CU 1. giC-N)=-C¢ - 1) 
CU 2. Tl@-)D=@-1 if@ 41. 
nP=l 
CU 3. II] ¢€-D=p. 
¢bhmidive 


For this last one, note that the pth roots of unity satisfy 
XP-t 4... + 1=0. 


Replacing X¥ by X + 1 yields the equation for € — 1, where € is a pth root 
of unity. The constant term is then p. Replacing X by X*"~* yields the 
equation for the general case, proving CU 3. The other properties are obvious. 

We may rewrite these relations to fit the formalism of distributions as 
follows. Let a € (Q/Z)™ and a # 0. Define 


Zo = e272 — |, 
Let V = &,/+p™ be the factor group of cyclotomic p-units by roots of unity. 
Theorem 3.1. The association of (Q/Z)® — V given by 
at> g, (mod roots of unity) 
Satisfies the distribution relations except at 0. 


The theorem means that for a 4 0 we have 


TI &> = 8a» 
pb=a 
and is obvious in the light of CU 2. 


Let V, = &,n/+ Up" be the factor group by roots of unity of p-units at 
level <n, i.e., generated by the roots of unity, and the elements € — 1 where 
¢ is a p"th root of unity #1. Whether we take ¢ to be primitive or not to 
generate V, is immaterial since the distribution relation shows that we get 
all of them from the primitive ones. 
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Let Y = Gal(Q(u,")/Q) mod o_,. The next theorem is due to Bass [Ba]. 


Theorem 3.2. The group Yi operates simply transitively on the primitive 
elements of V,, and the induced homomorphism 


Z[Gi]—> V,, such that o,+> Seip 


is an isomorphism. 


Proof. The homomorphism is obviously surjective. It is injective because 
Z[Y, ] is torsion free, and the ranks of the two groups are equal. This proves 
the theorem. 


Theorem 3.3. The factor group V,,/V, for m > n has no torsion. 


Proof. The embedding of V, into V,, corresponds to the embedding of 
group rings 


ZF. ] > Z[Fa] 
which sends an element o, on the element > o,, where the sum is taken over 
6, € Gn (c), the set of elements in Y{ which project on o, under the canonical 
map 
Git—>Gs., 
If an element 


> Dd kO)o, kb) eZ, 


c beg - (c) 


is a torsion element with respect to Z[G,*], then all the coefficients k(b) for 
be Gi(c) must be equal to each other, and hence the element already lies 
in Z[Y," ], as was to be shown. 


Analogues of Theorem 3.2 and 3.3 in the modular case are proved in the 
Kubert—Lang series [KL 2, 3, 4, 5]. In that case, it is also shown that there are 
no units except the modular ones. Here in the cyclotomic case, say for the 
p-primary component, it is the Kummer—Vandiver conjecture whether the 
factor group E/& is without p-torsion. 

For the rest of this section, it is convenient to use &, to denote the proper 
group of cyclotomic units, i.e., the group of units of the form 


n*, 


where z = ¢ — 1, ¢ is a primitive pth root of unity, and a = Dyegip%) K(b) oy 
is an element of Z[Y,], of degree 0, i.e., 


> k(b) = 0. 
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Theorem 3.4. Let p be odd, and let c be a primitive root mod p?. Then 6, 
is generated over Z[Y,]o by the element 


= ~f-1 
ae aaa aera 


Proof. We write an element « of degree 0 in the form 


a = > kb) — 1), 


and observe that o, — 1 is divisible in the integral group ring by o, — 1 
because o, is a generator of the cyclic group Y,. This proves the theorem. 


For p = 2 one has an analogous result using for c an element =1 mod 4 
such that c generates 1 + 4Z,. The group Y¥, = Gal(Q(,*)/Q) is not cyclic 
but a product of a cyclic group of order 2 and Y,', which is cyclic. 

It is convenient to reformulate the above theorem by passing to Y,’. 


Theorem 3.5. Let c be a generator of 1 + 4Z, if p = 2, and a primitive 

root mod p? if p > 2. Let 

: cil, 
v, = image of et in V,, 

and let V2 be the subgroup of V,, represented by units (not just p-units). 

Then we have an isomorphism 


Va & Z[GFr Jorn, 


so V2 is free of rank | over Z[Gx ]o. 


Proof. Clear. 


The composite case. 


Let V be the group of cyclotomic units of all levels, modulo the group of roots 
of unity. Then V is torsion free. 

Let c = (..., Cp,...) be a vector with a component c, € Z* for each p 
such that c, is a primitive root mod p? if p is odd, and cz = 1 mod 4 and 
generates 1 + 4Z, if p = 2. We have an associated automorphism o, on the 
full cyclotomic extension of Q. 

Let ae Q/Z and a¢ Z. We define 


a,(e2"!2 és. 1) 
euia — J] 


h(a) = image in V of 


Then A is an ordinary distribution in the sense of Chapter 2, §8 if we define 
(0) = 0. 
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Theorem 3.6. This distribution is the universal even ordinary distribution with 
value 0 at 0, and values into abelian groups on which multiplication by 2 is 
invertible. 


Proof. On (1/N)Z/Z the group generated by the image of / has rank 
3|Z)*| — 1, 


which according to Kubert’s Theorem 9.1(iii) of Chapter 2 is the maximal 
possible rank (the value 0 at 0 gives rise to the —1). The Kubert generators 
in T,/+1 must therefore be free generators, and the canonical map from the 
universal distribution to A must be an isomorphism. 


The above is more or less Bass’ theorem in a different formulation. (Also, 
as Bass states it, there is some difficulty with 2-torsion.) The idea of interpret- 
ing it in terms of the cyclotomic units forming a universal distribution is due 
to Kubert-Lang [KL 3], where a similar result is proved for the modular 
units. The essential step in the proof here is of course Kubert’s theorem cited 
above, combined with the independence of the units. In the cyclotomic case, 
this comes back to the non-vanishing of the regulator, i.e., L(1, x) # 0. In the 
modular case, see [KL 2] and [KL 5]. 


§4. The Iwasawa—Leopoldt Theorem and the Kummer—Vandiver Conjecture 


For simplicity throughout this section we assume that p is an odd prime. Also 
throughout this section, we let: 


Kx. = Q(u™) and Ko = Quy); 
G = Gal(K./Q), 
G, = Gal(K,/Q), 
G* = GY, mod o_, as in the preceding section. 
R, = Z(p"{Y,], and R = projective limit of R,. 


Remark. It is easy to see that 
Rx A[G] 
where A is the usual Iwasawa algebra. 


hj = class number of K;. We assume the Kummer—Vandiver conjecture 
that hx =(E,:6&,) is prime to p. 


Q, = K,(Vjl?") = K..(&)/2”"). By the Kummer—Vandiver conjecture, 
Q, = Ko(Epi’). 
Q = UQ, = Ko(E5?°) = Ko(@5'?”). 
G, = Gal(Q,/K.) and G = Gal(Q/K.). 
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We shall now develop the theory of G as R-module follow'ng the exposition 
of [KL 9]. We use an upper minus sign to denote, as usual, the (— 1)-eigen- 
space. This applies for instance to R~, G~, etc. 


Theorem 4.1. Assuming the Kummer—Vandiver conjecture, we have G = 
G~, and G is a 1-dimensional free module over R-. 


Proof. By the Kummer—Vandiver conjecture and Theorem 3.3 we have 
En KS" = Ep. 
Let us abbreviate for simplicity 
yale" — EMPIRE, 0 KB. 


Then the Kummer theory pairing discussed in §1 can be described more 
explicitly as follows. From Theorem 3.2 we write an isomorphism 


5: Dol Esl Fal* > yarn, 


using formal linear combinations with coefficients in (1/p")Z,/Z,. We have a 
model for Kummer duality, through the pairing 


Z(pGal 5 Z.p[ Zl Gn) —> bye 


such that 
> x(c)o, x > ylc)o, He ert & merucere 


This pairing induces a perfect duality 
1 
LZ(p"MF,]- x Pp Z,/Z[F,)* > Ups 
as follows immediately from the formula 
KAP, GP) = CA, )? 
where p is complex conjugation, or for that matter any element of G,. 
Therefore we have an isomorphism of the Kummer pairing in terms of the 


group rings, 


G, x Vil" —> psn 


I | | 


ey) | 
R, x — Ri —> pp 
Pp 6 Hp 
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and this isomorphism is compatible with the limiting process, which can be 
represented by a diagram in terms of the group rings for m > n: 


Rn x pm Rn —> Cp™ 


| [| 


Re x oe RL > ty 
It is then clear that 
G =limG, ~ lim Rj = R-, 
as desired. 


One would expect the units whose existence is conjectured by Stark [St] 
to play a similar role over totally real fields. 


Theorem 4.2. Let C, = Cl®(K,) be the p-primary part of the ideal class 
group of K, and let C = projective limit of the C, under the norm map. 
Under the Kummer—Vandiver conjecture, we have C=C, and C” is 
cyclic as a A-module. In fact, the maximal unramified p-abelian exten- 
sion of K,, is contained in Q, so we have a natural surjective map 


G-> Go ~~ C, 
the first map by restriction and the second by class field theory. 


Proof. The field diagram (once the theorem is proved) is as follows. 
Q 


Pe. 


What we have to do is to show that the maximal p-abelian unramified exten- 
sion of K,, is in fact contained in Q. The rest of the theorem is then obvious 
from Theorem 4.1. It will suffice to prove that a finite cyclic unramified 
p-abelian extension of K,., is contained in Q. 

Let K.(a) be unramified, with some element « such that «” lies in Ka. 
We first show that we may select « to be real. By Kummer-Vandiver, we 
have Gt = 1 so Gc = Gc. Let o be a generator for Gal(K,,(@)/K,.). Then 


oa = Ca for some pth root of unity ¢. 


Let p be complex conjugation. Then pop! = pop = a7} sineeG=G. 
Hence popa = €~1a, and therefore 


ot = Ca, 
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so that o(&/a) = &/«. Thus &/a = b lies in K,.. But the norm of b from K. 
to Kz is 1 (obvious), so by Hilbert’s Theorem 90, there exists 8 € K.. such 
that B/B = b. Then af is real, and K..(a) = K.(af). This shows that we may 
assume « real. 

For n sufficiently large, «* lies in K,j*, and K,*(«) is unramified over K,* 
because p is odd. Hence we have an ideal factorization 


(@) = o* 


for some fractional ideal a in K,*. The class of this ideal is principal by 
Kummer-Vandiver’s conjecture. It is then immediate that 


K,(a) = K,(u"?") 

for some unit u, thereby concluding the proof. 

In the next theorem we let 

Pingn = Gal(Ky/Kn)- 
Theorem 4.3. (i) For m > n we have an injection 
Ker(C, > Cy) > WH? nas En): 
(ii) The Kummer—Vandiver conjecture implies H* (Inn, Em) = 9, SO 
Cri —> Cn 
is injective. 
Proof. Let a be an ideal representing an element of C,, becoming principal 


in K,, say a = (a) with ae K,. For any element oe I,,, we have oa = a. 
Hence oa is equal to a times some unit. The association 


at+> cocycle class of (aa/a) 


is a homomorphism of Ker(C, — C,,) into H+ of the units, which is immedi- 
ately verified to be injective. 

Assume now the Kummer—Vandiver conjecture. Let &,, be the group 
of cyclotomic units. Then E,,/&,, has order prime to p, so 


H(6,) = H*(En). 
For simplicity let W,, = y,»+1. Then we have exact sequences 


0 > Wa mn —> Vn > 0 
and 


0> 6, > 6m > Z>0 
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whence exact cohomology sequences 


0 H*(Wp) > H*(65,m) > H*Vin) 
and 
0 H*(E,) > H(G,n) > HZ). 
Since H}(Z) is trivial and H+(V,,) is trivial (by elementary facts of cohomology 
of finite groups, and Theorem 3.2), it will now suffice to prove that H1(W,,) 
is trivial. By the theory of the Herbrand quotient (cf. for instance Chapter IX, 


§1 of my Algebraic Number Theory), the orders of H*(W,,) and H°(W,,) are 
equal. However, 


H(W,,) = Wit!Nnn Wn ie W,1NanWn- 


where N,,,, is the norm. Thus finally it suffices to prove that every p-power 
root of unity in K, is the norm of an element in W,,. Let ¢ be a generator of 
W,,. The elements of the Galois group are represented by p-adic integers of 
the form 


1+ xp"*? with x € Z/p"~*Z. 


Taking the norm yields 
Nast =] [ot = or 


which is a primitive element in W, and thus shows that H°(W,,) is trivial. 
This concludes the proof of the theorem. 


Let K be a number field and let K’ be an abelian extension with Galois 
group G. We assume that K and K’ are stable under complex conjugation. 
We say that the extension K’ of K is odd (resp. even) if its Galois group is in 
the (— 1)-eigenspace (resp. the 1-eigenspace) for complex conjugation. 


Lemma. If «?"** € K,*, then K,(«)/K, is an odd extension. 


Proof. Clear. 


From the lemma, it follows that Q,/K. is an odd extension, because the 
units are generated by real cyclotomic units and roots of unity. 


Theorem 4.4. Under the Kummer—Vandiver conjecture, the Kummer duality 
gives rise to a compact discrete duality 


Gal(Q/Q;)* dual to Cz 
and also 
Gal(Q/Q")* dual to Cs. 
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Proof. By Theorems 2.2, 2.3 and Lemma | of §2 we know that Q = Q, 
and that 


Gal(Q/Q;) is dual to C.. 


Taking eigenspaces for complex conjugation yields the first assertion. As to 
the second, we know from Theorem 4.2 that 


quer (em Qy 


and Q, is an odd extension of Q"". Again considering the eigenspaces yields 
the second assertion. 
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Iwasawa Theory of Local Units 


Iwasawa [Iw 8], [Iw 10] developed a theory of local units analogous to the 
global theory, taking projective limits, especially in the cyclotomic tower, and 
getting the structure of this projective limit modulo the closure of the cyclo- 
tomic units. He considers eigenspaces for the characters of Gal(K,/Q,) where 
Ky = Q,() with a primitive pth root of unity ¢. Since the cyclotomic units are 
essentially real, we consider only even non-trivial characters. Then the eigen- 
space is isomorphic to A/(g), where g is a power series which is essentially the 
p-adic L-function. 

The first section deals with the classical Kummer-Takagi exponents at the 
first level Q,(¢), where € is a primitive pth root of unity, p odd. This is used in 
combination with Nakayama’s lemma afterwards to get corresponding results 
in the cyclotomic tower. Throughout this chapter we assume that p is odd. 

Coates-Wiles [C-W 4] have extended this theory to the case of elliptic 
curves with complex multiplication. In the process they have found sub- 
stantial simplifications for Iwasawa’s proofs, and the exposition of this 
chapter is essentially due to them. Note especially their generalization of the 
Kummer homomorphism to all levels—a key to the whole theory. Such a 
homomorphism extends to other formal groups besides the multiplicative 
group, and a quite general statement has also been given by Coleman [Coll]. 

On the whole, this chapter may be viewed as giving a good introduction 
to the theories of Coates-Wiles. I am much indebted to them for keeping me 
up on their work. 


§1. The Kummer-Takagi Exponents 


Let € be a primitive pth root of unity, where p is an odd prime. Let Ky = 
Q,(0). We let 0, p be the integers and prime ideal of Ky respectively, and 
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let 
m= C€-1. 
Let U, be the group of units =1 mod p in Ko. We let Gp = Gal(K,/Q,), and 
Xo: Go > Up-1 © ZK 
be the homomorphism such that 
C7 = (%™, force Go. 


For simplicity of typography in this section we shall write x instead of xo. 
Let fe Z,[[X]]. We recall the variables 


T=14+X=é, 
and the differential operator 
D = d + X)Dx — D;z = TDy. 


This last equality holds only for rational functions of X (or 7). 
Let ue Uy so u = 1 mod yp. Let f be a power series =1 mod (p, X) such 
that 


u = f(x). 


We then say that fis a power series associated with uv. Such a power series is 
well defined up to a multiple of the irreducible polynomial ACX) of z over Z,. 
Let f, fi be associated with u, so f = f, mod A. Since f, is a unit power series, 
there exists a power series g such that 


S=f:0 + gh) with ge Z,[L[X]]. 
Then 


gh’ + hg’ 


Pip=filh + oF 


and 
Dflf = Dfi/fi + multiples of A and h’. 
Since / is an Eisenstein polynomial, it follows that 


D*~*(Df/f)(O) [= D* log f(0)] is well defined mod p for 1 < k < p — 2. 
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We define the Kummer homomorphism for these values of k by 
pdu) = D*~*(Df/f)(0) mod p. 

It is indeed clear that 

Mx: Ur > Z(p) 
is a homomorphism. By the change of variables X = e? — 1, the formula 

D*~*(Dfif (0) 
is also valid for f as function of Z, i.e., if we set 

I(X) = fe lZ) 
then 

D¥-*(Dflf)(0) = Dz (Dz fe,lfe,)O).- 


We now develop systematically certain properties of the Kummer homo- 
morphism. These will be extended in the Coates-Wiles manner later to all 
levels. 


K 1. If f,, fo are associated with units uy, Us, then f, fz is associated with 
UjUy. If f is associated with u and a€Z,, then f(X)* is associated 
with u®. 


Proof. The homomorphic property is clear, and has already been mentioned. 
The statement for a € Z, follows from positive integers by continuity. 


K 2. If f is associated with u, then a power series associated with u° is 


fi + xy — 1). 
Proof. If u = f(x) and f = 1 +---, then 


ul = fl + my — 1). 
So the property is obvious. Furthermore, 
fl + XY — 1) = fle? — 1). 


The next property then follows from the chain rule in terms of the 
variable Z. 


K 3. Q(u’) = x(o)*o,(u). 
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Let y be a character of Go. Let 


ee 


eX) => D Kode 


aéGo 


be the corresponding idempotent in the group algebra Z,[G,]. Write y = x* 
for some residue class « mod p — 1. Let u = 1 mod p. Put 


ue® = u(y). 
K 47). [fk = a mod p — 1 then o,(u(y)) = ¢,(u). 


K 4). fk 4 a mod p — 1 then ¢,(u(y)) = 0. 
Proof. By K 2 and K 3 we find 


Pu) = x*(e(y)) (4). 
The property follows by orthogonality of characters. 
The units 
1—x* fork =1,2,... 
generate U, topologically. We shall be especially interested in the values of k 


satisfying 1 < k < p — 2, and we shall orthogonalize these units with respect 
to the characters of Gp. We let 


ne = (1 — mye 


where we abbreviate 


elk) = efx") = pea > Moe. 


Lemma. We have n, = 1 — x* mod x**}, for1 < k < p — 2. 


Proof. We have 
ne =[ [GQ - @ — 2)-*-* mod 2?-? 
=[]@+ x" - 1). 
Say (° = £*. Then 
x *(a)(E7 — 1)" = a-*(E* — 1) = ({ — 1)* mod 27-3, 
as was to be shown. 
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Theorem 1.1. Let 1 < k,j < p — 2. 

Gi) gn.) = —k mod p. 

(ii) olny) = 0 if k # j. 

Proof. The second assertion is a special case of K 4(ii). As to the first, 
Px(Mx) = Px(l — 7%). 


An associated power series of 1 — x* is f(X) = 1 — X*, and 


FCO = 
Then 
Dfff(X) = —k(1 + X) > Xe 
= —k e2 > (e2 — 1)’* 
= —k e* mod Z*. 
Hence 
D*~*(Df/f)0) = —k 
as desired. 


By the lemma, and a trivial recursion procedure, any unit =1 mod p has 
a product expression 


u = nii---nizz2 mod n?-, 


and the exponents t, are called the Kummer-Takagi exponents. They are well 
defined mod p. 


Theorem 1.2. Let u be as above. Then 
1 
t, = = @ {u) mod p. 


Proof. Immediate, from K 4 and the fact that @, is a Z,-morphism. 


The Kummer Generators 


The rest of this section will not be needed, but is included for completeness 
of reference, and as an introduction to [C-W 2], [C—W 4]. It is convenient to 
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phrase the results in terms of the Lubin-Tate formal groups, so for the rest 
of this section, we assume that the reader is acquainted with the basic facts 
of these groups as explained in §1 and §2 of the next chapter, as well as the 
existence of the logarithm on such groups, as explained in §6 of the next 
chapter. 

Let A be a Lubin-Tate formal group over the p-adic field K, and associated 
prime zx. We let B be the basic Lubin-Tate group associated with the 
Frobenius polynomial 


X7 + 2X, 


Let W be the local parameter on B, and Z the parameter on the additive group, 
so we have 


Z =2,(W) = Wmod W?2-1 


by Lemma 2 of §6 in the next chapter. 
Let 


8(W) = bo + bW+--- 
be a power series with coefficients in K, and let 
&a,(Z) =d+daZ+--: 


be the power series obtained by putting g,(W) = gq,(As(W)). Then it is clear 
that 


b, = d, fork =0,...,q—1. 
Taking the logarithmic derivative, i.e., the operator 
gt>g'lg 


for any power series g, we then obtain: 


Lemma. /f 
82/82(W) = > Cx,pW*-? 
ea4 
86a/8a,(Z) = > CkGgl"* 
ka 
then 


Ck,B = Ck,Ge fork — ly; . +9 = 2. 
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Proof. This is trivial from the chain rule, writing 
&(W) = ga,(W + O(W2?)). 
Let wo be an element of B, such that 
wel+2=0. 
The field 
Ky = K(wo) 


is tamely ramified, with different p%- . 
Let u be a unit in Ko, and let g ¢ o[[W]] be a power series such that 


u = g(Wo). 
If g, is another such power series, then 


g(W) = g(W)K(W) 


where /A(W) is the irreducible polynomial of wo over K. From this it is 
immediate that 


g'/g(Wo) is well defined modulo §~ ?, and lies in o. 


In particular, if we write 
g'/g(wo) = > c.w6-*, with c, € 9, 
K=1 


then c, is well defined mod p for 1 < k < q — 2. We define 
y,{uy=c, forl<k<q-2. 
Then it is clear that 


Px? 93 > 9/P 


is a homomorphism, which we shall call the Kummer homomorphism of degree 
k. We shall determine the value of this homomorphism in special interesting 
cases. 

There is a character y of Gp into y,-1 such that 


oWo = X(C)Wo. 
To avoid technical complications, we now assume that K = Q, so that 
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o = Z,. We let 


oe 7 D>, ko "oe 


P o€Go 


be the idempotent in the group ring Z,[G,] for the character y*. If u is a 
unit =1 mod wo, we can define 


ub with teZ, 


in the obvious manner. We pick a sequence of integers m € Z approaching t 
p-adically, and the ordinary powers uw” approach a limit, which is by 
definition wu. 

The units 


1 — wé, withk = 1,2,... 
form a topological system of generators for the units =1 mod 9 in 09. Let 
Go = Gal(Ko/K), S80 Go © Uy-1- 
We orthogonalize a basis for the units. We let 
ne = (1 — wh), fork =1,...,p—1. 
Then: 
(i) n, = 1 — wk mod pk*}. 


(ii) on, = nk", i.e., me € Uo(k), where Uo(k) is the y*-eigenspace of Up, and 
U, are the units =1 mod pp in Ko. 


The second statement is obvious by the standard properties of the idem- 
potent e,. For the first, we simply expand the product 


Nh = Il (1 — wk)-*-"* mod wkt! 


=[T[d - x@)-* 
= (1 + wh)? 


=1—- wk 
as was to be shown. 
Theorem 1.3. Let j,k = 1,...,p — 2. Then: 
(i) pny) =0 ifk #j. 
(ii) p(n.) = —k mod p. 


Ifuis a unit =1 mod {, and 
u = ny: + -nP=2 mod w8-}, 
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then 


1 
t, = = @,(u) mod p. 
Proof. Taking the logarithmic derivative formally, we have: 


d log 
dwo 


dlog mee 
Me = “Fog iM (1 — x*(a)wh)-# 
nay —kxtt(ows7? 
= me k, 
a ere 
=k (Gg) wkd+D-1, 
2, 2 x(o)wk 


For j = 0 we get a term k(p — 1)wk-1 = —kwk-1 mod w§~2. This shows that 
9:(n,) = —k. On the other hand, if k(j + 1) — 1 < p — 3, we have 


kj+)sp-—2 so kj <p-3. 
Then 


x4 is not trivial, 


so the orthogonality relations show that the coefficient of the corresponding 
power of w, is 0. This proves (i) and (ii). The last assertion then follows from 
the homomorphic property of the map ¢,, thus proving the theorem. 


Let . = {a} be a finite family of integers prime to p, and let Y = {n,} 
be a finite family of integers satisfying 


[ [a = 1modp and yn = 0. 


u = u(t, 4) =T] (a = 1). 
Then u is a cyclotomic unit, and u = 1 mod fo. 
Theorem 1.4. Let 
u(t, NV) = t+ - -nPz3 mod w§-*. 


Then 


pos > nat mod 
k= Ke 4G Dp. 


where B,, is the Bernoulli number. 
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Proof. Let A be the formal multiplicative group, B the special Lubin— 
Tate group associated with it. The power series 


8e(Z) = 7 — 1 
corresponds to the power series g,(W) such that 
B_(Wo) = C7 — 1 


where ¢ is a pth root of unity. Directly from the definition of the Bernoulli 
numbers, 


Z — zk 
a oa 2, “ kl} 
it follows trivially that 
i ae 7 
8ala(Z) = 4+ > 7p Beak Z*. 
k=0°°° 


Since the operation g+> g’/g sends multiplication to addition, the theorem 
follows from Lemma 1 of §6 in the next chapter, and Theorem 1.3. 


§2. Projective Limit of the Unit Groups 


Let: 
K, = Q,(W,), W,, = Ppt}. 
Dn» P, = integers and maximal ideal in K, respectively. 
U,, = units =1 mod by, in K,. 
U, = units whose norms to Q, are equal to 1 


units which are infinitely divisible in the projective system of 
units under the norm maps N»,, with m > n. 


ll 


We have given two conditions describing U,, and it is easy to prove that they 
are equivalent. Indeed, we have the formula for the norm residue symbol: 


(u, Kn/Kn) cami (N,u; Kn/Qp) 


where JN, is the norm from K, to Q,. If N,u = 1 then w is a norm from K,, 
for every m. Conversely, if the left-hand side is 1 for all m, then 


N,u = 1 mod p™ 
for all positive integers m, so N,u = 1. This proves the equivalence. 
Let: 


Q 
3 
ll 


Gal(K,/Q,) and G. = Gal(K./Q,). 
= Gal(K,/Ko) and I = limI, = Gal(K./Ko). 


3 
| 
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Note that Gp operates on K,, U,, U, and 
G, ~ I, xX Go, while G, ~I x G). 


Let y be a topological generator of I’. Then r'?" = Gal(K.,/K,) is generated 
by y?". 


We have an exact sequence of Galois modules 
1—> Ui — U,, “> subgroup of Z* > 1. 
From this sequence we conclude that for each character y of Gp and x # 1, 
U,G) © Ux(x)- 


In the next lemma, by rankz, we mean (as usual) the rank of a module modulo 
torsion over Zp. 


Lemma 1. (i) rankz, U,(x) = p”. 
Gi) If xy # 1, %o then 
Ux) » ZP”. 
Proof. The integers 0, contain a free submodule over the group ring 
Z,[Gal(K,/Q,)], 
and for large r, 1 + p’o, is Galois-isomorphic to the above submodule under 
the exponential map, and is contained in U, with finite index, so the first 
part of the lemma is clear. 


For the second part, the only torsion in U, consists of the roots of unity 
W,, which is a %o-eigenspace. Hence for y # 1, %) we have an isomorphism 


U(x) = ZP” 
as desired. 


We consider the groups U, as forming a projective system under the norm 
maps, and we let 


U = lim U, 
<— 
be the projective limit. Then from the definition of U, we see that also 
U = lim Uj. 
<_ 
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Note that U is a topological, compact Z,-module, and also a A-module, where 
A = lim Z,[I,]. 
< 
If u is an element of U, then we view uw as a vector 
u=(...,Up,---) 
with components u, € U, such that Ny nln = Un form = n, and we also write 
u = lim u,. 


Lemma 2. U has no Z,-torsion. 


Proof. Otherwise there exists a fixed power p’ and an element u = lim u, 
such that uv?” = | foralln. Then wu, isa root of unity, and ifu, 4 1 for some n, 
then the order of u,, becomes arbitrarily large as m becomes large, which is 
impossible. 


Theorem 2.1. For each character y # 1, %9 of Go there is a A-isomorphism 
U(x) = A. 
In other words, U(y) is free of dimension 1 over A. 


The proof will occupy the rest of this section, and will result from a 
sequence of lemmas. A “natural” basis element for U(y) over A will be given 
in the next section. 

We shall apply Galois and class field theory in a manner similar to the 
global case. For simplicity of notation, if X is a '-module, we let: 


Xiny = X/[(y?"” — 1I)X and X™ = fixed elements under y?". 

For simplicity of notation, throughout this section, denote by K2" the 
maximal p-abelian extension of K,, and similarly let K2? be the maximal 
p-abelian extension of K,,. We have a tower of fields 

K, © Ko © K2° < Kee. 
Recall that F operates by conjugation on Gal(K2°/K.). 


Lemma 3. If y # 1 then we have isomorphisms 


Gal(K2*/K.)(y) © Gal(K2"/Ka)en() 
~ Gal(K?/K,)(x). 
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Proof. This is clear from the fact that K2° is the maximal abelian extension 
of K, contained in K3°, together with the exact sequence 


0 — Gal(K?"/K..) > Gal(K*"/K,) > Gal(K./K,) > 0 
together with the fact that the last term is ¥ Z,. 


The Galois group Gal(K?"/K,) is isomorphic by class field theory with the 
completion of K** under the topology of subgroups of finite index. There is a 
topological isomorphism as abelian groups 
K* = Z x o%. 

Given a choice of prime element x in K,, the isomorphism has the form 
K# = 22 x o*. 

The completion of K; in the topology of subgroups of finite index is therefore 
K* = n® x o* 


as abelian groups (not Galois modules), where 


Z= vy Z, (product taken over all primes /). 
1 


On the other hand we have an exact sequence of Galois modules 
1 > U, > KP /tp-1 > Z 0. 
Since Gy operates trivially on Z, for each y # 1 of Gy we have an isomorphism 
U(X) © (KF /by (2). 
The isomorphism of local class field theory 
Ky /dp-1 © Gal(Kp?/K,) 


preserves the [ and G, structures of both groups. Passing to the projective 
limit over n, it follows from the previous isomorphism that 


U(x) = lim Un(~) & Gal(K2"/Ka)(), 
whence we obtain the next theorem from Lemma 3. 
Theorem 2.2. For x # 1 we have an isomorphism 
U.Q) = UGG?" — DUG) = Um. 
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Lemma 4. Let M be a finitely generated A-module such that 

M/(y"" — 1)M 
is free over Z, of rank p" for all n. Then M is quasi-isomorphic to A. 
Proof. Obvious from the structure theorem in Chapter 5. 


The lemma is applied to the unit groups, using Lemma | and Theorem 2.2. 
We therefore conclude that there is an exact sequence of A-modules 


0>A>UYy)-A> B-0 


where A, B are finite. Since U has no Z,-torsion by Lemma 2, it follows that 
A = O. The next lemma will conclude the proof. 


Lemma 5. In the exact sequence, we have B = 0, for x # 1, Xo. 
Proof. From the exact sequence 
0>U(y)- A> B->0 
we get the exact (cohomology) sequence 
0 > Uy) + AM => B™ > UD» 


[This is no big deal in the present instance. The last map is obtained by taking 
an element b € B™, lifting back to any c € A, and sending c t-> (y?” — 1)c. This 
is well defined in U(y)/(y?" — 1)U(), and the sequence is trivially verified to 
be exact.] 

Trivially AM = 0. Hence we obtain an injection 


0 > B™ + Un ~ Un(y) by Theorem 2.2. 
But U,(y) has no torsion by Lemma I(ii). Hence 
Bm =0 for all n. 


Since B is finite, this implies that B = 0, and proves Theorem 2.1. 


§3. A Basis for U(y) over A 


For each n > 0 we let W,, = ppn+1, and we fix a family of primitive p**+th 
roots of unity ¢, ¢ W, such that 


Chat = lone 
We let 
Xn = ln 1. 
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The notation remains that of the preceding section. If o € G,, then there is 
an isomorphism 


%: Go > Ze 
such that for all n, 
C= ORO. 
As before, we let 
%o: Go > p-1 


be the corresponding isomorphism at the first level. If y is a character of Go, 
with values in y,_, then 


X = Ho 


for some k determined mod p — 1. 
Given y # 1, % we shall construct an element & € U such that the element 


ay) = ee 


is a basis of U(x) over A. It is natural to construct &, of the form 


fn = w(b)~*(b — xn) 


where 6 € Z*, and w is the Teichmuller character. We have divided by w(b) 
so that €, = 1 mod p,. For each n > 1 we want that 


Nan-1(b — Xn) = b — Xqn-1. 
But x, is a root of (1 + X)? = ¢,_1 so the equation for b — x, over K,_, is 
CLD YP ety yO, 
and from the constant term we see that 
(1 + Bb)? — Cy-1 = Nan-106 — a) = 6 — Xy-1 = b — (C,-1 — D. 


Thus 
d+6)=1+06, so (14+ 5)"1=1. 


Therefore we select any 
b=A-—1, withany AA1 and dJep,-_, 
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to get the desired b. A choice of 4 determines such €, and we write 
é€# instead of €, 
if we wish to emphasize the dependence on A. Since 
a(b)? = a(6), 
it follows that for n > 1, 
Nn,n-10(b) = (6)? = a(0), 
and so the elements €, form a projective system in U. 


Lemma 1. Given y = x§ # 1, xo there exists A€ u,-, such that if we let 
b=A-— 1, and 


fo = EY = w(b)-(b — x0), 
then: 


(i) (Eo) # 0 mod p; 


Gi) Eo(y) = e(y): Eo generates U(x) over Z,. 


Proof. We shall check below that for a suitable choice of 2 (depending 
on k) the Kummer-Takagi exponent given by Theorem 1.2 is 4 0 mod p. 
Then €,(y) generates Uo(x)/Uo(x)”, and hence generates Uo(y) over Z, by 
Nakayama’s lemma. 

Now for the computation of the Kummer—Takagi exponents, we need only 


compute @,(€)) by K 4. We have 
fo = 9 (1 — xb) 
° wb) edd 
The associated power series is 
FU) = 2 (1 — Ib) 
~ o(6) : 


Then 


FO = -5 yp = - 5D, 0 


We want to prove that 
D*~?*(Dfif) # 0 mod p. 
181 


7. Iwasawa Theory of Local Units 


We have 


Dif = (+ vf) =44 21454, 


But D = TDy. Hence it will suffice to prove that for 2 < k < p — 2 we have 


1 
k-1 
(TDz) (+ lees # 0 mod p. 


By induction, it is immediately shown that 


1 TA"™-1 + P,(T, A) 
(TD;)” (fe r3)- toast 


where P,,(T, A) is a polynomial in 1 of degree < m — 2, with coefficients in 
Z[T]. Hence 


AR? + Pya(l, A). 


caer Oe 


croy(e+5)| 


The numerator A*~? + P,_,(1, 4) is a polynomial in A of degree < p — 4. 
It is clearly not identically zero mod p, and so it has at most p — 4 roots mod p. 
We can therefore choose 4 # 1 in y,_, such that / is not a root of the 
polynomial mod p. This completes the proof. 


Theorem 3.1. Let x # 1, xo. We can choose 1 € ftp_, such that the element 
fo) = 6) 

generates U(y) over A, i.e., 
U(x) = A(x); 


and such an element is a free basis for U(x) over A. 


Proof. We know from Theorem 2.2 that 
U(x) = UI — 1)UG), 


and so by Lemma 1, e(y)-€ generates U(y) mod m,- U(y). By Nakayama’s 
lemma, it follows that €(y) generates U(y) over A. Since U(y) ~ A by Theorem 
2.1, such a generator is also a basis, thereby proving the theorem. 


§4. The Coates-Wiles Homomorphism 


In this section we give the extension of the Kummer homomorphism to all 
levels, based on a refinement of the associated power series. 
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Theorem 4.1. To every element ue U there is a unique power series f, in 
Z,LLX]] such that 


Sun) = Un- 
This power series satisfies f,(X) = 1 mod (p, X), and the map 
ut>f, 


is a homomorphism of U into the multiplicative group of power series 
=1 mod (p, X). 


We first note that uniqueness is obvious since a power series has only a 
finite number of zeros (Weierstrass preparation theorem). 

The proof of existence will proceed via several steps, which also develop 
systematically other properties of these series. First: 


1 
CW 0. ROY =e X) 
is the power series associated with our element &. Indeed, 


An) = yp © — 4) = be 


Next we note two formal properties of the power series f, which is called the 
associated power series to u. 


CW 1. If ae Z, then the power series associated with u® is f,(X)*. 


Proof. This is first obvious when a is a positive integer, and is then true 
for all a € Z, by continuity. 


CW 2. If f, is associated with u, and o € Go, then there is a power series 
associated with u’, namely 


ful X) = ACL + XY — 1). 
Proof. If u, = > a,x, with a, € Z,, then 
un = > a(fz — 1, 
so the property is obvious from the definitions. 


We are now ready to prove the theorem, i.e., we must show that every u 
has an associated power series. The two properties CW 1 and CW 2 show 
that the set of elements in U having an associated power series is a A[Go]- 
submodule of U, and contains €. So it contains A[G)]é. In particular, taking 
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Theorem 3.1 into account, we have already shown that it contains U(y) for 
all y # 1, %. This is enough for the applications we have in mind. 

For the remaining eigenspaces of vy = | or x, the element & does not 
suffice to generate these spaces, and one must show how to find an associated 
power series for additional elements generating these spaces. This is not too 
difficult and will be left to the reader, especially since a generalization of the 
associated power series to all Lubin—Tate formal groups has been given by 
Coleman [Col]. 

Let as usual 


D => ad + X)Dx = Dz. 
Since f,’/f, has coefficients in Z,, it is clear that 


Dfulf(X) € ZlLX]- 


For each integer k > 1 we define the Coates-Wiles homomorphism ¢, on 
U by 


px(u) = D* log f,(0) = D*~*(Df./f.)(0). 
We see that g,,: U-—> Z, maps U into Z, by the preceding remark. 


CW 3. Foro eGo, ou’) = x(c)*¢,(u). 


Proof. If ut-> u’ then CW 2 gives the power series associated with u’, and 
the assertion is then obvious by the chain rule applied to 


D* log f,2(0) = Dé log f.(e%? — Diz-o- 
CW 4. Let y = x6 where a is a residue class mod p — 1. 
(i) Ifk = «mod p — 1, then p,(u(y)) = 9x). 


(ii) If k # a mod p — 1, then o,(u(y)) = 0. 


Proof. Let e(y) be the idempotent associated with xy. Then by CW 1 and 
CW 3 we find 


eu) = x5(e(X) GU). 
The property follows by orthogonality of characters. 
CW 5. For ue U and geA = Z,[[X]], we have 


o.(g-u) = g(x(y)* — le,(u). 


Proof. The assertion is true when g(X) = 1, and when g(X) = 14+ X =y 
by CW 3. Thus it is true when g(X) = X, ie., 


Od X-u) = (x(y)* — Deu). 
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The property follows for arbitrary polynomials by induction, and arbitrary g 
by continuity. 


Theorem 4.2. Given a congruence class « mod p — |, there exists a power 
series h, such that for any k = «mod p — 1, we have 


(1 — p¥~*)@x(6) = halxe(y)* — 1). 


If « is even 4 O mod p — 1, then we can choose A such that 


ee) 
is a unit, and h, is a unit in Z,[[X]]. 


Proof. Let 


A(X) = D log f(X) = 1 + XFUAX). 
Then by Meas 6 of Chapter 4, 


gx(€) = D¥-4f,(0) = if x1 dy (2). 


Then a computation shows that 


(1 = pox) = DHF) = [a don (a) 


z3 
= | a* dy(a) 
Zp 


for some measure yp. By decomposing the integral over cosets of ,_, in Z* 
we can write 


[., e@ray* dua) = 


yf  aduta 


TELp—1 Y1+PZp 


where py, is a measure with support in 1 + pZ,. By Example 2 of Chapter 4, 
§1, we conclude that for each r there is a power series f, such that 


Me __ # dala) = fo)? ~ 


and we let 


h= > Ff. 


TEUp-1 
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Then for k = a mod p — 1, we get 
Mey = 1) =| waar data) 
Zs 


= i a® du(a). 


* 
Pp 


This concludes the proof of the existence of 4. There remains to show that 
9, {€™) is a unit—it is then trivial that A is a unit power series. But this last 
property is clear from Lemma 1 of §3, as was to be shown. 


§5. The Closure of the Cyclotomic Units 
Let &, be the group of cyclotomic units, i.e., the group generated by 


W, = tu,+1 and elements 


¢-1 
where ¢ is primitive p"*+th root of unity. Let: 

V,, = closure of & A U, in U, = &r,1 

V = lim V,. 

<— 
If we wish to preserve the &-notation, then we may write 
V = lim &,, i Coa 

The group V, is a Z,[G]-module, and V is a A-module. Since &/W,, comes 
from the real subfield, y-eigenspaces occur only for even characters x of Go. 


Before analyzing the projective limit of V,, we recall in the p-adic context 
some facts about finite levels. In the global fields, we have an isomorphism 


é,/W, ~ Z(G Jo, 
where the index on the right indicates the augmentation ideal. Cf. Theorem 3.2 
of Chapter 6. Since the cyclotomic units are independent over Z, by the 
non-vanishing of the p-adic regulator, we obtain a G,-isomorphism 
6,/W, & ZylGz lo. 
Hence for each even x # 1, 
G./WMO) © ZolGt (x) = ZolGt lo e(y)- 
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Let c € Z* be a primitive root mod p?, and let 


f, — 1 
c—1 


VD, = oc) 


Then v, = 1 mod »,. Furthermore for any even character x # 1 the element 
v(x) = vg 


lies in &,, so in V,(y). (The root of unity w(c) disappears when we project on 
the eigenspace for x.) Note that the elements v, form a compatible system in 
the cyclotomic tower, that is 


NanUm = Un, form > n. 
We let 
v = lim »,. 
<_ 
Then 
v(y) = lim v,() = lim v8. 
Theorem 5.1. For each even character y # 1 we have 
VO = Zp[G.]}eon@), 
and hence 
V(x) = A-v(y). 
Proof. Immediate from Theorem 3.2 of Chapter 6. 


We note that the power series associated with the element v is 


AD = 00) Te F" 


Let y be an even character of Go, and y # 1. Let € = €&™ be the con- 
structed element of U such that &(y) is a basis for U(x) over A. We have 


U(x) = A-S(x) and V(x) = A-v). 


Let us write 
v(x) = 8,-E(x)_ with some g, EA. 
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Theorem 5.2. Let y = x6 be an even character #1. We have an isomorphism 


UM/V(X) = A/g,A. 


The power series g, (determined up to a unit in A) can be selected such that 
it is equal to the power series g satisfying 


gle)" — 1) = (1p) (1 - 8) EB, 


for any even positive integer k such that k = a mod p — 1. 


Proof. We have 


Gr) = 9(0(X)) by CW 4 
= gy(x(y)* — 1)e.(E(x) by CW5 
= gy(x(y)* — 1)@,(6) by CW 4 


= g(x(y)* — 1). — p*-7)73, 


where g = g,h,, and h, is the power series of Theorem 4.2. Since A, is a unit 
power series, g and g, generate the same Ae(x)-ideal. There remains to prove 


glo) = (I~ 8) 7 Be 


But 
: eX el aa) 
(+ HN L00 = - a 
SOE 8 i EEN 
~ e@— J e&—] 
= Dflf. 
So for k > 2, 


D¥~*(Df.lf.)(0) 


1 
(1 — ct)e By 


Qx(v) 


by the definition of Bernoulli numbers. This concludes the proof. 


The values of the power series g show that it is essentially the p-adic 
L-function. 


Remark. In this chapter we have proved a local statement which would be 
immediate if one had the Kummer—Vandiver conjecture, as explained in 
Chapter 6, §4. The corresponding global conjecture can also be formu- 
lated as follows. 
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Let K, = Q(u™), and let M, be the maximal p-abelian p-ramified ex- 
tension of K,,. Let x be any even character #1. Then there is a 
quasi-isomorphism 


Gal(M,/K.)(x) ~ 4/g,4 


where g, is as in Theorem 5.2. 
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Lubin—-Tate Theory 


This chapter reproduces with little change the approach to local class field 
theory given by Lubin-Tate [L-T]. Using special power series associated with 
prime elements in a p-adic field, they construct maximal abelian totally 
ramified extensions by means of torsion points on formal groups, thus 
obtaining a merging of class field theory and Kummer theory by means of 
these groups. 

The theory applies in particular to the cyclotomic case. The p"th torsion 
point on a suitable group will be seen to be the classical cyclotomic numbers 


€¢-1 


where € is a p"th root of unity. 


§1. Lubin-Tate Groups 


Let 0 be a ring. By a formal group over 0 we mean a power series 
F(X, Y)€o[[X, Y]] 
in two variables satisfying the three conditions: 
FG 1. F(X, Y) = X + Y (mod degree 2). 
FG 2. F(X, FCY,Z)) = FF(X, Y), Z). 
FG 3. F(X, Y) = F(Y, X). 


Strictly speaking, our formal groups should be called commutative one- 
parameter formal groups, but we won’t deal with any others. The expression 


190 


§1. Lubin-Tate Groups 


mod degree 2 means modulo the power series of degree >2. Using the 
associativity with Y = Z = 0 it follows at once that 


F(X, Y)= X + Ymod XY, 
ie., F(X, 0) = X and FO, Y) = Y. 
It is an easy matter to show recursively that given a formal group as above, 
there exists a unique power series A(X’) such that 
A(X) = — X (mod degree 2) 
and 
F(X, A(X)) = F(ACX), X) = 0. 


If this could not be proved, we would assume it as an axiom. We leave the 
proof as an exercise. For the more extensive foundations of formal groups in 
any number of variables, cf. Frohlich [Fr]. 


Example. The formal multiplicative group G,, is defined by 
F(X, Y)=X+Y+xXY=04+ X)d4+ Y)-1. 
If a is a positive integer, and [a] denotes “‘addition”’ on G,, a times, then 
[a(X) = 1 + Xx)? - 1. 


If M is an algebra over 0 (always assumed commutative) and M is nilpotent 
(in the sense that every element of M is nilpotent—some positive power of 
the element is 0) then the formal group F defines an additive group law on the 
set of elements of M, by the association 


(x, y) +> F(x, y) 
for x, yin M. Instead of F(x, y) we would also write 
F(x, y)=x+ry, or x[+]y, or x[+]ry. 


The set of elements of M with this group law could be denoted by M;. On 
the other hand, it is useful to use a slightly different notation. We view F as 
defining a functor 


Mt> My. 


We may also denote this functor by a letter like A (or A, if we wish to make 
the reference to F explicit), and then denote 
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to be the set of points of A in M. As a set, it consists of the elements of M, 
and it is also an additive group with the group law determined as above. 

Suppose that o is a complete valuation ring, with quotient field K and 
maximal ideal my. We also write 0 = ox. Then my = m is topologically 
nilpotent, in the sense that arbitrarily large powers of an element tend to 0. 
For any positive integer k, m/m* is a nilpotent o-algebra, and A(m/m*) is a 
group, as we saw. By continuity, it follows that A(m) is also a group. Addition 
between elements x, y in m is again given by 


(x, y) > F(x, y). 


Let L be any algebraic extension with valuation ring 0, and maximal ideal 
m,. Then we also have the completion 6, if L is infinite over K, with maximal 
ideal th,, and it is clear that A(wi,) can again be defined as group with the 
group law given by the same formula as above. 

By an endomorphism of the formal group F (or A;), we mean a power 
series f(X) such that 


SEXY) = FFX); f(Y). 


We say that fis defined over 0 if the coefficients of f lie in o. It is then clear 
that such an endomorphism defines an endomorphism of A(m) by the 
association 


xt>f(x), for x in m. 


Similarly, a homomorphism / of a formal group F into a formal group F’ is a 
power series such that 


SR(X, Y)) = F'G(X), f(Y)). 


This relation could also be written 
S(X +r Y) =f(X) +r f(Y). 
Such homomorphism induces a group homomorphism 
A(m) —> A‘(m), 


where A’(m) is the group whose underlying set is m, and whose group law is 
that determined by F’. 

We shall be interested in a special kind of formal group. From now on, 
we assume that 0, is a discrete valuation ring, and we let x be a prime 
element in my. We assume that 0,/my, is finite with g elements. We let: 


F, = set of power series fe o[[X]] such that 
F(X) = 1X mod degree 2 
F(X) = X%mod z. 
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Example. The power series (polynomial) f(X) = X? + 7X is an element 
of F,, actually its simplest element, which will be called the special or basic 
Frobenius polynomial associated with z. 

Example. Let 

(YO) = 4+ XP —- 1 = XP +--+ py. 
Then f(X) is an element of A. 


The elements of ¥, will be called the Frobenius power series determined 
by z. 


Theorem 1.1. To each Frobenius power series f in F, there exists a unique 
formal group F, (defined over 0) such that f is an endomorphism of F,. 


The formal group associated with f(Y) = X? + 2X in Theorem 1.1 will be 
called the special or basic Lubin—-Tate group associated with the prime z. 


The proof of this theorem will follow from a general lemma, as will the 
fact that the formal group F, then admits 0 in a natural way as a ring of 
endomorphisms commuting with f- 

Lemma. Let f and g be Frobenius power series in F,. Let 


LX, ry X,) = a,X, feet ar he an,Xn 


be a linear form with coefficients a; in 0. There exists a unique series 
F(X, ..., Xn) € o[ LX, ..., X,]] such that 


F(X, ..., Xn) = L(X1,..., X,) mod degree 2 
and 


SFY, ete ey X,)) _ F(g(X,), eons 1039. 2(X,)). 


Proof. We abbreviate X¥ = (X,,..., X,) and g(X) = (g(Xj),..., g(X,)). 
We show by induction on r that the congruences 


FX) = L(X) mod deg2 and f(F,(X)) = F,(g(X)) mod degr + 1 


have a solution F,(X) in o[X] which is unique mod degr + 1. This is true 
for r = 1 with F,(X) = L(X). Suppose it true for r > 1. We let 


Frat => F, ag Ay 41 
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where H,,, is a homogeneous polynomial of degree r + 1 with coefficients 
in 0. We have: 


f(Fra1(X)) = SFX) + 2H 41(X) mod deg r + 2 
F,+1(8(X)) = F(g(X)) + 2°*7*H,41(X) mod deg r + 2. 


To satisfy the desired relation up to degree r + 1, we must take 
A, 41(X) = AEC) = EAC) mod deg r + 2. 


The coefficients are in 0 because 
SE(X)) _ F,(g(X)) = (F(X)? as F(X% = 0 (mod Tt). 


It is then clear that 
F(X) = lim F,(X) € of [X]] 


is the desired unique solution satisfying the conditions of the lemma. 


Addendum to the lemma. The completeness of 0 was not assumed or used 
in the proof. Furthermore, the proof shows that F, is the only power Series 
with coefficients in an extension field of K satisfying the conditions of the 
lemma. 


Theorem 1.1 is immediate from the lemma. Indeed, F; is the unique power 
series F(X, Y) such that 


F(X, Y) = X + Ymod deg 2, 
and 


SR(X, Y)) = FY(X), f(Y))- 


The other two formal group properties are seen to be satisfied by showing 
that the left-hand side and right-hand side of FG 2 (resp. FG 3) are each the 
unique solution of a system of conditions as in the lemma. 

We call F, the Lubin-Tate formal group associated with f If we want to 
use the other notation, we also write it A(/), or simply A if the reference to 
J is clear from the context. 

We shall now see that F, admits o as a ring of endomorphisms in a natural 
way. We prove slightly more. For each pair of elements f, g¢ A, and aco, 
we let a;,, or [a];,, denote the unique solution of 


a;,g(X) = aX mod deg 2 
S°As.g = Ug °8- 
We write a, or [a], instead of a, , for simplicity. 
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Theorem 1.2. The association at a, is an injective ring homomorphism of 
9 into End(F;), such that 


ty =f 


More generally, the association at+ a, is an injective additive homo- 
morphism of 0 into Hom(F,, F;), satisfying the composition rule 


;,g° Bg, = [ab]yn 


and 


[a + b]y,(X) = Fy(@yo( X), by,6(X)) 


Q(X) +p, by,o(X). 


Proof. In each case, one checks immediately that both the left-hand side 
and right-hand side of the desired identity are solutions of the type given in 
the Lemma, whose solution is unique. 

It is clear that iff, g ¢ F, then the element 1, , is an isomorphism between 
F, and F;. Thus the isomorphism class of F; is uniquely determined by z. 

Furthermore, from Theorem 1.2, we may also view F; as an o-module via 
the operation a, for aév, and the above isomorphism is obviously an 
o-isomorphism. 

As a matter of notation, we shall use the three notations 


to denote the same power series. After a while, the polynomial f in F, 
becomes mostly irrelevant, and we think in terms of the group law A. Thus 
a, or [a] when A is fixed become more satisfying to work with. 

Let L be the completion of an algebraic extension of K. Then we may view 
A(m,) as an o-module in the obvious way. The operation of 0 on A(m,) is 
given by 


xt>a,(x) for xem;,. 


Of course, if L is finite over K, then L is equal to its own completion. By 
functoriality, we also see that the formal isomorphisms 1,,, induce isomor- 
phisms 


A(g)(m,) => Af (m;). 


In view of this isomorphism, it is often convenient to omit f or g from the 
notation and write [a](x) for the operation of 0 on A(m,) for x in my. 

Let L be a Galois extension, with Galois group G over K. The operations 
of elements of G on L extend to the completion by continuity, so we may 
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replace L by its completion. Since the power series a; .,, a;,;, F; have coeffi- 
cients in 0, it is then clear that the operations which they define on m, 
commute with the action of G on m,. 


§2. Formal p-adic Multiplication 


Again we let 0, be a discrete valuation ring with quotient field K, which we 
assume complete. We let m, be the maximal ideal, and let g = card 0,/mx 
be finite, a power of the prime p. We let f be a Frobenius power series over 0, 
associated with the prime element z in mx, and we let F; or A = A(f) be 
the corresponding Lubin—Tate group. 

For each aé 0, we let A,(/) be the set of elements x in the maximal ideal 
mya of the algebraic closure of K such that 


a,(x) = 0. 
In other words, A, is the kernel of [a]. If a is a unit, then Ker a, = 0, so we 
are really concerned with A,» for positive integers n. Of course, A, depends 
on f so we should write A,( f ). However, if g is another Frobenius power 


series in 0 associated with the same z, then the isomorphism 1,,, induces an 
isomorphism 


Aig) => Aa(f), 


which commutes with Galois isomorphisms. Further, if o is an automorphism 
of K*® over K, then 


a;(ox) = o(a,(x)), 
so A,(/) is a Galois module, and the extension 
K(A,(/)) over K 
is independent of the choice of fin F,. We shall see in a moment that it is a 
separable extension, whence it is a Galois extension, and is finite for a 4 0 
because a non-zero power series has only a finite number of zeros (Weierstrass 
preparation, or more naively, use the power series X? + 2X in F,). 
Consider the case n = 1, so consider K(A,). Then 
K(A,) = K(x) 
where x is a root of X7 + nX = 0, x ¥ 0, or in other words, x is a root of 
Xt14n7=0. 
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Thus K(A,) is a Kummer extension (since the (q — 1)th roots of unity are 
in K), with abelian Galois group, cyclic of order g — 1, and totally ramified 
over 7. 

Let xe A, and x # 0. The map 


at> a,(x) 


gives a homomorphism of 0 into A,, whose kernel is obviously zo. Since 
A,, has q elements, it follows that 


A, & 0/n0 
as o-module. In particular, End, A, ¥ 0/x0, and 
Aut, A, % (0/n0)*. 
We have a representation 
%: Go = Gal(K(A,)/K) > Aut, A, & (0/n0)*. 


Since Go and (o0/z0)* have the same cardinality, namely g —- 1, it follows that 
this representation is an isomorphism. 
We have similar results in the 2"-tower. 


Theorem 2.1. (i) The group A,» is a free 1-dimensional module over o/n"o. 


(ii) K(A,") is abelian over K, totally ramified, and we have a natural 
isomorphism 


“: Gal(K(A,")/K) & (0/2"o)*. 


Proof. Let (x1, X2,..-.,; X,) be a sequence with x, ¢ A,*, such that x, 4 0 
and 2;(x,) = X,~1. Without loss of generality we may assume 


F(X) = X97 4+ 1X. 
For k > 1 we see that x, is a root of 
XY + 0X — Xp4 = 0. 


Relatively to the field K(A ,»-1) this is an Eisenstein equation, and so we have 
shown inductively that K(A,») is totally ramified. Since A,» is stable under 
the Galois action, and since the equation 


X74+ 2X — x,_1 = 0 


is separable, it follows that K(A,")/K is Galois. As before, we get a repre- 
sentation of the Galois group in Aut, A,». The map 


at>a,(x,) 
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induces an injection of o/x"o into A,», whence an 0-isomorphism by counting 
cardinalities, and it follows as for n = 1 that we have an isomorphism as in 
ii), thus proving the theorem. 


Passing to the limit, we may form the projective limit 7,(A), consisting of 
all infinite vectors 


(Xo; X1,-. ) 


such that 2,(x,) = X,-, and 2,(xo) = 0. It is then immediate that 7,(A) is 
a free 1-dimensional module over o. 
Let 


K, = K(A), Ko = (J Ky 
Then K., is an abelian, totally ramified extension of K, and 
x: Gal(K../K) = 0* 


in the natural way. If uv is a unit in o*, then we have a corresponding element 
of the Galois group, denoted by o,, which is such that 


Oo, = [u],~* 


in the representation on T,(A(/)). If we wish to omit the reference to f, we 
simply write [uv]. Thus on a vector as above, we have 


Gy “(Xo X15 ---) = ([u](%o), [u](), - - -)- 


It is also convenient to have a notation for the representation of the Galois 
group in o*. We let 


“: Gy —> o* 


be this representation, where Gx, = Gal(K*/K), such that 


ox = [x(a)],(x) for xe A™. 


Example. We shall now give the standard example with the formal 
multiplicative group 


F(X, Y)=X+ V+ XY. 
Over the p-adic integers 0 = Z, we have the Frobenius series given by 
f(X) = (1 + XP — 1 = XP +--+ + pX 
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associated with the prime p. Let A be the corresponding Lubin—Tate formal 
group. Then in fact A is defined by the power series 


F(X, Y) = F(X, Y), 


i.e., A is the formal multiplicative group. Then A,» consists of those elements 
in the maximal ideal of the algebraic closure satisfying the equation 


(d+ xy" -—1=0 
and these elements are none other than 
Cooks 
where ¢ is a-p"th root of unity. 


Theorem 2.2. The prime x is a norm from every extension K(A,"). 


Proof. Consider first the bottom level of the tower K(A,) over K, obtained 
from the equation 


X14 n= 0. 
Let « be a root. Then 
(—1)*-14N(@) = 2. 


If g is odd then z is the norm of «. If q is even then g — 1 is odd, the degree 
[K(A,,) : K] is odd, and —1 itself is a norm. Hence z is the norm of —«. 
This proves the theorem in case n = 1. For the proof in general, let 7 = xo, 
and let 


(Xo, X1;---) 
be such that 
Te s(Xn) = Xn-1- 


Thus x, is an element of A,, x; # 0, and x,_, is a norm of +x, from the 
field K(x,) over K(x,-1). The argument is similar and equally trivial, as 
desired. 


Theorem 2.3. Let B be the special Lubin-Tate group associated with the 
prime x and the Frobenius polynomial X* + 1X. Let € € ug_1. Then: 


@ [c(X) = x. 
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(ii) If F(X, Y) is the group law for B, and 
F(X, Y)=X+ ¥+ > ayX'Y!, 
then a, = 0 unless i + j = 1 (mod p — 1). 
Proof. Let x, be a generator of B,n+: such that 
[x](%n) = Xn-1- 


Since xX, is a root of X2-1 + 2 = 0 it follows by a trivial recursion that the 
irreducible polynomial for x, over K is a polynomial in X%~1. Therefore we 
can find an automorphism 


o, € Gal(K,/K) 


such that o,x, = ¢x,. Since elements of the Galois group commute with [z], 
there exists an element o € Gal(K.,/K) such that 


ox, = Cx, for allan. 


By Theorem 2.1 there exists a € 0% such that ox, = [a](x,) for all n. Since 
o7-1 = |, it follows that a is a (q — 1)th root of unity. But also 


Cx, = [a]x, = ax, mod x? 
so € — a= 0mod x,. This is impossible since both ¢, a are in py_;, unless 
¢ = a, thereby proving (i). 
Secondly, for every € € y,-1 we have 
CF(X, Y) = FCX,CY) = EX + CY + D> ayglit Xt? 
Then (ii) follows immediately. 


§3. Changing the Prime 


We shall now analyze what happens when going from one prime z to another 
prime x’ = zu where u is a unit of og. Since we have to refer to the primes, 
we let 


K® = K® = |_) K(4,»). 
We let A’ be the Lubin-Tate group associated with x’, so 
KM = KJ K(A,). 


We let L be the completion of the maximal unramified extension of K, with 
ring of integers 0,. We let K,, be the maximal unramified extension of K, 
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and o,, its valuation ring, with maximal ideal m,,. We let @ be the Frobenius 
automorphism of K,,, extended by continuity to L. 


Theorem 3.1. Let f, f’ be Frobenius power series over 0, associated with the 
primes n,m’ respectively. Let ¢ be a unit of 0, such that e®/e = u. (Such 
units exist.) Then there exists a unique isomorphism 


0: F, fo F f’ 
defined over 0, which commutes with the operation of 0, that is for alla ino. 


Oca, = ay 0 8,” 
and such that 
6(X) = eX mod deg 2. 
This power series @ satisfies 
0? = Bou, 


Proof. The existence of the unit ¢ such that e°/e = u is easily obtained by a 
recursive procedure, and is left to the reader. We then construct a power series 
O(X) = eX +--- to satisfy 6° = 60 u, as follows. Let 0,(X) = eX. Suppose 
we have found 6,(X) of degree r such that 


6?(X) = 0,(u,(X)) mod degr + 1. 
We wish to find some element b € 0, such that the series 
O,41(X) = 0X) + bX 

satisfies the same congruence to one higher degree. We have: 

0943(X) = OP(X) + bexXrtt 

0,41 °Uf(X) = 0,0u,(X) + bu(X)tt. 
The condition on 3 is therefore that 
b° — butt =e, 


where c is the (r + 1)th coefficient of 0, 0 u,(X) — 0°(X). Write b = ae™t?. 
The condition on b is equivalent with the condition 


a? — a = cert, 
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A recursive procedure, letting a,,, = a, + x2"*! shows that we can solve 
for x at each step to make the equation valid mod 2”, whence solve the 
equation for a in o,. This concludes the construction of 0. 

We shall now see that 6 almost satisfies the other conditions of the theorem, 
and that it is easy to adjust it to get these other conditions exactly. Let 


g = 0on,' 0673. 


It is obvious that 0 o F, 0 8-1 commutes with 0 o 2,’ 0 871 = g. We contend 
that g is a Frobenius power series associated with x’, and has coefficients in 
0 = 0x. Once we have proved this contention, we then conclude that the 
power series 


OF (0 *(X), 8 *(¥)) 


has the properties characterizing F,(X, Y) (it is obvious that this power series 
is congruent to X + Ymod deg 2). The Lemma and addendum to Theorem 1.1 
show that the power series is equal to F,(X, Y). Similarly, we verify that 
6 0 a,o @~* has the properties which characterize a,, so is equal to a,. In this 
manner, we have proved the theorem except for the fact that 


0:F; > F, 
is an isomorphism from F;, to F,. Replacing @ by 1,-,, ° @ then concludes the 
proof. 
All that remains to be done is settle the contention. We have: 
On, O-1(X) = en'e 1X = n'X mod deg 2. 
Also, 


On,'0-(X) = Oupn,O-*(X) = O°(f(0-*(X))) 
= 6°(0-1(X)%) mod 2 
= 6°(0-%(X4)) mod x 


= X! mod z. 


There remains only to prove that the coefficients of 67;0~+ lie in 0, and it 
suffices to show that they are fixed under the Frobenius automorphism 9. 
We have: 

(6n;0-1)? = 0°fu,0-° = 0°f0-1 = Ou, fO-* = On;0-?, 


which concludes the proof of the theorem. 
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§4. The Reciprocity Law 


Let K,, as before be the maximal unramified abelian extension of K. Local 
class field theory would immediately show that the composite field 


K OR 


is the maximal abelian extension of K. On K™ we have a good model for the 
Galois group given by the association 


at> [a], aeo*, 


and on K,, we have the Frobenius automorphism, which generates the 
Galois group. We wish now to give an independent proof that the field KK, 
is independent of the choice of z, and that the structure of the Galois group 
is in fact determined independently of that choice also. 


Theorem 4.1. The field K™K,, is independent of n. Let ac K*. Write 
a= un™ 


for some unit u, and some integer m. Let r,(a) be the automorphism of 
KK, such that: 


r,(a) = o,0n K” 


r,(a) = 9" (y = Frobenius) on Kr. 


Then the association at-> r,(a) is independent of the choice of x. 


Proof. Let L be the completion of K,, as in the preceding section. Let A 
be the Lubin-Tate formal groups associated with the prime z, and let A’ 
be associated with the prime x’. Since A and A’ are isomorphic over L by 
Theorem 3.1, it is clear that 


LK® = LK®, 
However, K,, is algebraically closed in L. The two totally ramified extensions 
K™K,, and K®°K,, 


become equal when lifted to L. By elementary field theory, they must be 
equal as extensions of K,,, thus proving the first assertion in the theorem. 

The set of prime elements x’ generates the multiplicative group K*. To 
prove the independence of r,(a) from the choice of z, it will therefore suffice 
to prove that for all x’, 


r,(t’) = ry’). 
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These automorphisms coincide on K,, since they both give rise to the 
Frobenius element. It will therefore suffice to prove that they coincide on 
K™. We keep the notation of Theorem 3.1. Write x’ = ux with some unit wu. 
Since r,-(x’) = identity on K“”, we are reduced to showing this same property 
for r,(z’). 

Let f be a Frobenius power series associated with x. The field K®™ is 
generated by the elements 6(x) with xe Af). Hence we are reduced to 
showing that such elements are fixed by r,(z’). Indeed: 


r(™')O(x) = r,(u)r.(7)0(x) 
r,(u)O7(x) 
O°(u;~*(x)) 

= (x). 


ll 


This concludes the proof of the theorem. 


One may now use local class field theory to guarantee that K™X,, is the 
maximal abelian extension of K. Let 


(a, Kap/ K ) 


be the norm residue symbol mapping K* into Gal(K,,/K) from local class 
field theory. Then we find 


r(a) = (4, Kay/K). 


Indeed, both automorphisms induce the Frobenius automorphism on K,,, 
and for any prime element z, both automorphisms induce the identity on 
K™, since z is a norm from every finite subextension of K™ by Theorem 2.2. 
Since r(a) and (a, K,,/K) coincide on all prime elements, they coincide on 
K*. 


§5. The Kummer Pairing 


It should be clear that the formalism of formal groups is completely analogous 
to the classical formalism on the multiplicative group or the group of Witt 
vectors. In a similar way, one can develop ‘“‘Kummer theory”’ completely 
analogously to the standard way (cf. for instance Algebra, Chapter VIII, §8), 
or the way Witt did it in characteristic p for his vectors (Crelle 1935-1936). 
The possibility of doing this in the context of Lubin—Tate groups was first 
noted by Frohlich [Fr]. Of course, some new phenomena arise. Applications 
to explicit reciprocity laws as in Coates—Wiles [C-W 2] and Wiles [Wi] will 
be postponed to a later chapter. 
Let A be a Lubin—Tate group associated with the prime 7. We let 


K, = K(A nt 1) 
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so Ky = K(A,). We let 0, be the ring of integral elements in K, and let p, 
be its maximal ideal. We write A(p,) as usual for the set of elements », with 
the group law defined by A. We define a pairing 


A(p,) x Kn > A,n+. 


Let x € A(p,), let ¢ be an element of A(p..) such that [x"*+?]t = x, and let 
a € K*. Note that actually t € A(p?"*+). Define the symbol 


«x, On = Ot —a4l, 
where o, = (a, K2°/K,) is the automorphism of K2° over K, arising from 
local class field theory. Then it is clear that <x, «> lies in A,»+1 and is inde- 
pendent of the choice of tf such that [x"*1]t = x. We call it the local (Kummer) 


symbol (relative to the formal group A and the multiplicative group). If we 
want to specify A in the notation we write 


Ki; Oe 


Example. The formal multiplicative group. For 8 = 1 mod p, and «e K* 
we define the classical norm residue symbol 


(B, Dn = <B— loan + I, 

where A is the formal multiplicative group. 

The local symbol trivially satisfies the following properties. 

LS 1. It is ox-linear in x and multiplicative in a. 
In particular, the symbol induces a pairing 

A(pn)/[a"**]A (Pn) x KR/KE"* > Ago, 
and it is clear that in the pairing 
A(p,) x K* > Ape 

the kernel on the left is exactly [x"*+]A(p,), because if x is not a [x"*1]- 
multiple in A(p,), then its [x”*1] root t generates a proper extension of K,, 


so the Galois group operates non-trivially. 


LS 2. If 8: A — A’ is an isomorphism over 0 between two Lubin-Tate groups 
associated with the same prime n, then 


<x, aya = O(KO-*(x), aa). 
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LS 3. If o is an automorphism of K*” over K then 
XOX, OM)>n = OXX, Dn. 
LS 4. <x, >, = 0 if and only if « is a norm from K,(t), where [x"**]t = x. 


This last property uses a fact from local class field theory which could be 
proved from the Lubin—Tate formalism, but which we shall take for granted. 
Otherwise, the other properties are obvious. 


As an application of LS 4, let N,,,, denote the norm from K,, to K, for 
m > n. Then the orthogonal complement of A,»+1 under the pairing is given 
by 


1 * 
A,ati = Non+1,nKen41- 


LS 5. Let m = n and let Nmn,, denote the norm from K,, to K,. Then for 
xe A(p,) and ae K* we have 


[x™-"]<x, > m = <x, Nmnn(O)>n- 
In other words, Nw.» is the transpose of [x"~"]. 


Again this is clear from the functorial properties of the norm residue symbol 
which we assume. We can then define the symbol in a limit situation as 
follows. We let 


T(Kz) = group of sequences (%, a,...) with a, € K* such that for 
alln = 0, 


Nn+i,n%n+1 = Oy. 


Thus T(K%) is just the projective limit of the groups KZ under the norm 
mappings. We may then define a pairing 


A(Po) X T(KS) > T,(A) 
by letting 
xx (Xo, O1,-- a > (<x, Oo>os «x, %1>15 ae .). 


On the right-hand side, the components <x, %m>m are defined for all m 
sufficiently large, i.e., m > n such that x € A(p,). The components <x, a>, 
for k < n may then be defined by applying the appropriate power of [z] to 
the nth component. Property LS 5 shows that this is well defined. The pairing 
is ox-linear in x and multiplicative in « e¢ T(K%). 
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Of course it is a considerable restriction on an element a, in K* to be 
liftable to an infinite vector of consecutive norms. In fact, let N, be the norm 
from K,, to K, and let 


Nz 1(n®) = group of elements « € K* whose norm N,« is a power of x. 
Lemma 1. We have Nx *(x®) = (.) NnunKit. 
moan 
Proof. Suppose N,« € x”. Then 
(a, Ky,/Kn) =y (Nim, K,,/K) € (x7, K,,/K) =1 

because z is a norm from each extension K, by Theorem 2.2. Hence « is a 
norm from K,,, thus proving one inclusion. 

Conversely, suppose that « is a norm from each K,, for m > n. Then 


1 = (@, Kn/Kn) = (Nn%, Kn/K). 


Let N,« = m’u where u is a unit in K. Since zx is a norm from K,, by 
Theorem 2.2, we conclude that 


(u, K,/K) = 1 for all m. 


Hence u = 1 mod p" for all m, so u = 1. This proves the reverse inclusion, 
and proves the lemma. 


LS 6. Assume that [x](X) is a polynomial. If p is odd then 


(i) <a, &>, = 0. 


Whether p is odd or even, we have 


(ii) <a, —&>_ = 0, 

(iii) «x, -lD,=0 ifford,x>e+t+1 

and so 

(iv) K4,0,=0 iford,a>e+]. 
Proof. Let 


[n"**\(X) = F(X). 
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Let ¢ be a root of f,(X) = a. The extension K,(Z) is independent of the choice 
of t. Thus, if we factor f,(X) — a into irreducible polynomials over K,,, say 


fO=8= I frdXs 


then for each j we obtain K,(t) by adjoining a single root of f,,,(X) to Ky. 
Therefore, if c,,; denotes the constant term of f, (X), and d is the common 
degree of the irreducible polynomials f,,, then we conclude that (—1)*c,, 
is a norm. But 


s 
-“%= | | Cay 
g=1 


Hence —a = (—1)* times a norm, and (—1)** = (—1)"*? is a norm. This 
proves the first two assertions. For the last two (relevant only for p = 2), 
it is clear from bilinearity that 

2x, -—Ib = 0. 


If ord, a > e+ 1, then « = [2]y for some y, and so <a, —1> = 0, thus 
proving the last two assertions. 


The following lemma gives information on the factor group 
A(p,)/[n"**]A(Pp), 


by showing that near the origin, the operator [z] operates very regularly. We 
let xz, denote a prime element in K,. 


Lemma 2. Assume k > gq". Then A(nnko,) = [n]A(n*o,). 


Proof. The inclusion > is obvious. We prove the reverse inclusion. Let 
z = nn*t with te o,. We must solve 


xi'+nax=z with x=7*y and yeo,. 
This is equivalent to 
maryt 4 qPa-Vtky = agke, 
But k > gq" implies that gk > q"(q — 1) + k, so we are reduced to solving 
fy =a +y-t=0 


with a divisible by z,. Since f(t) = 0 mod z,, and f’(t) = 1, Hensel’s lemma 
does it. 
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As consequences of the lemma, we find for instance: 
A(n**?0,) < [x"*JA(p,) 
Ane ***t0,) < [n]A(p,). 
This second inclusion comes from using k = q" + 1. 


Theorem 5.1. Let w, be elements of », for i= 1,...,q"*1, such that 
ordy, w; = i. Then these elements generate A(p,) mod []A(p,), and therefore 
generate A(p,) mod[x"*+]A(p,) over o. 


Proof. Since X +4 Y= X + Ymoddeg2, given xe, we can find 
a, € o such that 


x —4 [a:]w, = 0 mod pf, 


because [a,]w, = a,w, mod p2. We may then proceed recursively to find 
dz,..., Agr+1 Such that 


x = [ay]wi +4 [ae]we +4° ++ +4 [Gqnt1]wan+t mod pees 


By the lemma, this congruence also holds mod[z]A(p,). Hence the w; generate 
A(p,) mod[z]A(p,), whence by Nakayama’s lemma, they also generate 
A(p,) mod[x"**]A(p,). This proves the theorem. 


The special case when K = Q, is of importance in the cyclotomic theory 
(and elsewhere), and some refined statements can be given as in the next two 
theorems due to Coates—Wiles [C-W 1], [C—W 2]. 


Theorem 5.2. Assume K = Q,. Then A(p,)/[x"**]A(p,) is free over o/n"* 10. 
Suppose that A is the basic Lubin-Tate group. Let I be the set of integers i 


Satisfying 
1<i<p"*! with (i,p)=1, or i= p"*}. 
Let xo be a non-zero element of A,, and let 
(X63. X58 Kass) 
be an element of T,(A), that is [x]x,.1 = X,. Then the elements 
{xi} withiel 


form a basis. 


Proof. For ie I we let w, = x‘. On the other hand, if p’ divides i exactly, 
we take 


wi = [n"](x1?’). 
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This shows that the elements x}, with ie J generate A(p,) mod[x"*1]A(p,), 
over 0. 

There remains to prove that they are linearly independent mod 2"+1o. 
We first show that they are linearly independent in A(p,)/[x]A(p,) over 0/0. 
Suppose we have a relation 


(*) >, [ailx, = 0 mod [r]A(p,), 
where >, indicates the sum with respect to the group law of A, and some 


coefficient a; is a unit, say a,. We may assume a, = 1. 


Case 1. k < p"*?!. For any x € », we know that 
[z]x = x? +--++ px. 


Either the term x? dominates this expression, in which case ord», x is divisible 
by p, or some other term dominates, which means that 


p-ord x > p"(p — 1) + ord x, 


so ord x > p", and ord [z]x > p"*}. This implies that we cannot have a 
relation of congruence (*) because as with ordinary addition, if y, y’ € A(p,) 
and ord y ¥ ord y’ then 


ord(y +4 y’) = min(ord y, ord y’). 


Hence there cannot be any cancellation in the sum of the left-hand side of 
(*), thus concluding the proof in Case 1. 


Case 2. k = p"*1. Then by Case 1 we may suppose that a, is divisible by 
n for all i # k, and therefore x?"*" lies in [z]A(p,). We use the hypothesis 
that A is the basic Lubin-Tate group, and then there exists y € p, such that 


y+ ny = xt 
But x8" ~ x) and x87! = —z. The above equation is clearly impossible if y 


is not divisible by x) because the orders on the left-hand side cannot match 
the order on the right-hand side. Then we divide by x,? to find 


(y/X0)” — (y/%0) = x8"**/x8 = unit. 
Reading this equation mod py, yields a solution of 
Y? — Y= unit mod p, 
in the residue class field Z/pZ, which is impossible. This proves the theorem. 
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Theorem 5.3. Assume that K = Q,, and that K(A,) does not contain the 
pth roots of unity. Then the local pairing 


A(pn)/[a"* JA(pn) x KA/K#P"** > Agno 


is exact on both sides, i.e., the kernels are 0 on both sides. 


Proof. In Theorem 5.2 we have determined the order of 


A(p,)/[x"**]A(P2). 


It is a standard exercise of local algebraic number theory [L 1], Chapter II, 
§3 to determine that 


order of K>*/K*?"** = (A(p,): [n"*"]A(p,))p" 


where p’ is the order of the group of p-power roots of unity in K,. If K(A,) 
does not contain py, then neither does K,. Hence 


Kine 


has the same order as A(p,)/[z"*1]A(p,), and we know that the kernel on the 
left is trivial. Since A,»+1 is cyclic in the present case, it follows by the duality 
of finite abelian groups that the kernel on the right of the pairing must also 
be trivial, as desired. 


Remark. When the pth roots of unity are in K(A,), in particular when 
A = G,,, the above argument definitely shows that the kernel on the right is 
non-zero. 


§6. The Logarithm 


Let A be a formal group, defined by a power series F(X, Y) over some ring 0 
with quotient field K of characteristic 0. It can be shown that there exists an 
isomorphism 


A:A>G, 


with the additive group, i.e., a power series with coefficients in some extension 
of K such that 


MX +p Y) = A(X) + ACY), 
A(X) = X mod deg 2. 
The + sign on the right-hand side is the ordinary addition. That power series 


is then uniquely determined, and its coefficients lie in K. It is called the 
logarithm on A, and we write A = A, if we need to refer to A explicitly. 
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Example. Suppose A = G,, is the formal multiplicative group. Then the 
log is given by 


A(X) = log(1 + X), 
where the log here is the usual series from calculus. 


It is easy to show that any endomorphism of G, is given by multiplication 
with a scalar, i.e., if a power series / satisfies 


A(X + Y) = A(X) + ACY), 


then A(X) = aX for some constant a. Hence the uniqueness of the log A, 
follows at once. In this section we shall prove its existence for Lubin-Tate 
groups, and additional properties, following Wiles [W] in preparation for the 
explicit reciprocity laws. 


Lemma 1. The limit 

Ayaan 5 n(X) 
exists, and gives a formal isomorphism of the Lubin-Tate formal group A 
with the additive group Gy. 


Remark. The limit is to be understood in the following sense. Each term 
1 n (n) Vk 
1 ma(X) = >: Ck X 


is a power series. By the existence of the limit, we mean that for each k, 


lim ch = c, 
n 


exists as n > oo, and then 1(X) is defined to be > c,X*. The convergence will 
not be uniform in k. 


Proof of the lemma. We look at the difference 


1 1 
qtr ma t"(X) = a ma(X) = qe (x, ° ma(X) = mn(X)). 
mA(X) = n"X + g,(X) 
where 
g(X) = > aPuix? and i+j2zn+1, jz 2, 
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and the coefficients a{? are in ox. Then the right-hand side of the required 
difference is equal to 


1 
nett a(n" X + 8n(X)). 


We are interested in the coefficients of monomials of degree <k for a fixed k. 
Reading all expressions mod X**+ we see that we may assume 


i=>n+1—(kK+4+1). 
Hence x"X + g,(X) is divisible by x"~*. Similarly, the power series expression 


for g, is divisible at least by 2™~*. Since g, has degree =2 it follows that 
g(n"X + g,(X)) is divisible at least by 


n - ky 2(n -k) 2” +r- 3k 


Dividing by x"*”’ shows that the required difference tends to 0 as n> oo. 
This proves that the desired limit exists. 


It is clear that A(X) = X mod deg 2. 
There remains to prove that / satisfies the homomorphism property. We 
have: 


1 1 
qn MAX +a Y) = = (WAX +4 M4) 
1 n n 1 n ia i 
= 4 (4X + MY+G > cyma(X)'0A( YY, 
qT 1 i+j722 
where c,; are the coefficients of the formal group 


X +4 Y= > eyX'¥!, 


For each fixed k, m the coefficient of X* Y” in the sum on the right-hand side 
tends to 0 as n tends to infinity, so the additivity follows. 


Lemma 2. The log 4, commutes with the action of 0, that is, 
Asag(X)) = ad(X) for ae og. 


For the basic Lubin-Tate group B, if 
A(X) = X+ > a,x! 
1=2 


then a, = 0 unless i = 1 modgq — 1. 
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Proof. The function X+> A,(a,(X)) is an additive formal power series. 
such that 
As(a4(X)) = aX mod deg 2. 
The uniqueness of the logarithm shows that this function is aA,. 
For the basic Lubin-Tate group, we take a = £ where ¢ is a primitive 


(q — 1)th root of unity, and apply Theorem 2.3 to conclude the proof. 


Lemma 3. (i) Let 4’ denote the formal derivative di/dX. Then 4',(X) has 
coefficients in 0. 


(ii) The series 14(X) can be written in the form 


1X) = > eX) At 


where g(X) € o[[X]]. In particular, it converges on the maximal ideal. 
(iii) Suppose g = 3 and let x € K* have ord, x > 1. Then 


Aa(x) = x mod x?. 
Proof. For (i) we differentiate with respect to Y the relation 
AMF(X, Y)) = AKX) + ACY) 
and get 
MAF(X, Y))D2F(X, Y) = 24(Y). 
We then put Y = 0, and find 
MAX) DoF(X, 0) = 1. 
But from F(X, Y) = X + Ymod deg 2, it follows that D,F(X, 0) is a power 
series whose constant term is 1, and with coefficients in 0. This proves the 
first assertion. 


As for (ii), it suffices to prove the result for the basic Lubin-Tate group 
whose Frobenius power series is given by 


X7+ 1X, 


because if yy: A — Bis an isomorphism such that y(X) = X mod deg 2, then 
Ag = Agow. 
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It also clearly suffices to prove the following statement: 
The power series [x"|(X) lies in the module 

o[[X]]n/x" 
with j => 0, k = 1 andj + [log, k] = n. 


We prove this by induction. It is obvious for n = 1. Assume it for n. Let 


[n"\(X) = fi(X) = D Bn X)0IX*. 
Then 
[x"*21(X) = fa(X)* + mf,(X). 


It is immediate that 2f,(X) satisfies the induction hypothesis with respect to 
n + 1. For the term f,(X)‘, it will consist of cross terms which binomial-type 
coefficients divisible by p, hence by 7, thus satisfying the desired conditions 
on the exponents, or terms 


Sil X 7a", 


The log, of the exponent of X is increased by one, and so the desired in- 
equality is also satisfied. This proves (ii). 
Part (iii) is obvious from (ii). 


Observe that in the simplest case of the ordinary log, 


2 
log + X= X- A+... 


If p = 2, the first term after X gives trouble. If p = 3, the next term which 
might give trouble is 
x3 


cera | 


3 


but in this case, the assumption ord, x = 1 shows that (iii) holds. After that, 
things only get better. 


Lemma 4. Let e,(Z) be the power series (with coefficients in K) which is 
the inverse of A4(X). Let D be the disc in mx consisting of those elements z 
such that 


ord, z > : 
q- 
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Then e,(Z) converges on this disc, and induces the inverse isomorphism to 
Ag, on the groups 
4A 
A(D) = G,(D). 
A 


For z in this disc we have 
ord z = ord A,(z) = ord e,(z). 
Proof. Let ye D, y # 0. Define 
1 
A(X) = 5 Ad yX). 


Then for i > 0, 


gal 
q-1 


1 
ord = (q'—l)ordy—i> 


xt 


—i. 
By Lemma 3, it follows that A,(X) has integral coefficients. Let E, be the 
power series such that E, oA,(X) = X. Replacing X with y~'X we see that 


E,(Z) = y~*e,(yZ). Since E, has integral coefficients (because 1,(X) = 
X + higher terms), we conclude that 


1 
y e(yZ) 


has integral coefficients. Let e,(Z) = > a,Z". Then a, y"~+ is integral for all 
nand all y in D. It follows that in fact, a, y" tends to 0 (p-adically) as n tends 
to infinity for each y in D. Furthermore, we then conclude that 


€4(y) € YO«* 
for all y in D, and in particular, 
ord e,(y) = ord y. 


On the other hand, again using Lemma 4, it is immediate that for x in D, 
we have 


ord A,(x) = ord x. 
Since e, and A, give inverse mappings, we get 
ord e,(y) = ord y = ord A,(y), 


thus proving Lemma 4. 
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We may then recover immediately a lemma proved in connection with 
Theorem 5.1. 


Corollary. (i) A(x"*?0,) < [x"**]A(p,). 


(ii) AsA(Pn) > TOn. 
Proof. Clear. 


Lemma 5. The kernel of 1, in the maximal ideal of the algebraic closure of 
K is precisely Ajoy, the group of torsion points on A, or in other words, the 
group A™. 


Proof. A point x is a torsion point if and only if [x"]x is a torsion point 
for some positive integer n, or for every large positive integer n. But [x"]x 
approaches 0, and for large n, lies in the neighborhood of 0 where the 
exponential and log on A give inverse mappings. Since on the additive group, 
there are no elements of finite order, it follows that the kernel of 1, is precisely 
A™, 


§7. Application of the Logarithm to the Local Symbol 


We recall that the finite extension K, is self dual as a vector space over K 
by means of the trace. This means we have a non-degenerate K-linear pairing 


K, x K,2>K 
given by 
(x, y) +> T,(xy). 
Let a be an ideal in K,. We denote by a" the set of elements y € K, such that 
T, (xy) € 9 = Ox. 


Of course, we have the notion of perpendicularity with respect to any given 
pairing, and the context will always make clear which is meant. We have 


a+ = Hom,(a, 0). 
Indeed, let yy: a > 0 be a p-homomorphism. Then y can trivially be extended 
to a K-linear functional K, — K denoted by the same letter. But then for some 


a € K, we have 


W(x) = T,(x«) for all xe K,, 
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and «e€a+ by assumption. Our identification of a+ with Hom,(a, 0) then 
follows at once. If D, = Dx,)x is the different, then 


= q-1m-1 


We return to the pairing given by the local symbol 
A(pn)/[n"**]A(Pn) x Kai > A,nts. 


We had already noted as a consequence of LS 4 that 


4 pas * 
Azn+1 7 Non+1,nKone1- 


Observe that we are dealing with two orthogonality signs: One referring to 
the local symbol, and one referring to the duality 


K, x K,— K/o 


(where the trace is viewed as having values in the factor group K/o), applied 
in particular to an ideal 


a x atop, 
Then we have the pairing 
A(p,)/([n"**JA(p,) +4 A_%*1) X Atnar > Agnes. 


Since Ker 1, = Ator, we have Ker 1,0 K, = A,»+1. Applying the log map 
of A, we get a pairing 


AA(pq)/2"*AA(p,) X Adne1 > Agner. 


Let a = AA(p,), so the factor group on the left is a/x"*1a. In the light of the 
exact duality 


a/n"* 1q x at/n"* igi => o/n™* 1p, 
there exists a unique group homomorphism 
Wnt Agnes > AA(p,)*/0"**2A(Pq)* 


such that for x € A(p,) and « € Atn+1 we have 


LS 7. <x, 08 = [TrAa)n(@))4@n)- 
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This formula has been written without abbreviations, but of course in the 
future we frequently omit indices A, n, etc. If o ¢ Gal(K*/K) and o+> x(a) 
is its representation in o* on 7,(A), then 


LS 8. Walaa) = x(o)Wa(a)’. 
Proof. We have 


a<a-4x, a> = <x, oa> = [T, Asn (o4)) xn, 
and also 
a¢a-1x, a> = [T,((o~ As) Wa(@)) oxy). 


But ox, = [x(c)]x, by the definition of x, with x(c) € o*. Using [ab] = [a][b] 
and T,(ay) = T,(y) concludes the proof of LS 8. 
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Iwasawa [Iw 8] proved general explicit reciprocity laws extending the classical 
results of Artin—-Hasse, for applications to the study of units in cyclotomic 
fields. These were extended by Coates-Wiles [CW 1] and Wiles [Wi] to 
arbitrary Lubin-Tate groups. Although Wiles follows Iwasawa to a large 
extent, it turns out his proofs are simpler because of the formalism of the 
Lubin-Tate formal groups. We essentially reproduce his paper in the present 
chapter. 

We assume that K is a finite extension of Q, (i.e. has characteristic 0) 
because we want to use the logarithm. 

We allow p = 2, and I am indebted to R. Coleman for showing me how 
Wiles’ paper extends with essentially no change to that case, by using (ii), 
(iii), (iv) of LS 6, and the minus sign in DL 6. 

We let: 


A = Lubin-Tate group associated with the prime element z. 


We let: 
(Xo, X%1,..-)€7,(A) with xo 4 0. 
K, = K(x,) = K(A,"+1). 
N, = norm from K, to K, and N,,,, = norm from K,, to K, for m > n. 


T, = trace from K, to K. 


We abbreviate < , >, to < , > unless we wish to specify the level at which 
the symbol is taken. 
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subgroup of K* consisting of those elements whose norm to K 
lies in x” (i.e., is a power of 2). 


(\ NnnK* by Lemma 1 of Chapter 8, §5. 


mon 


Nz (a7) 


T(K%) = group of vectors (%, %,...) with «, ¢ K% such that 
Ninjn&m = On 


§1. Statement of the Reciprocity Laws 


Theorem 1.1. Suppose « € 0, and a = 1 mod p,. Then 
N,a = 1 mod n™*+ 
and 
1 = 
(Xns On = qe (Nm vie 1) Xn: 
Proof. By the formalism of the norm residue symbol, we know that 
1 = (a, K,/Kn) = (Nn%, Kn/K). 


Hence [N,,«]x, = x, by the Lubin-Tate theory, so the first assertion is clear. 
We choose t = Xen+1 SO that [x"*+]¢ = x,. Then 


Xn a> = yt —at. 
Since (a, K,(t)/K,) = (Naa, K,(t)/K) we obtain from Lubin-Tate theory 


{Xn > = [Na *]t -at 
= [N,a7? — 1]¢. 


Using the first congruence and the fact that [x"**]t = x, yields the theorem. 


Corollary 1. Let « = 1 mod ,. Assume that K is unramified over Q,. Then 
1 
(Xn O) = |-an T,(log | Xn 


where the log is the ordinary log on the multiplicative group. 


Proof. Since x is unramified, we can write 
N,wt=14+2 
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where z = 0 mod p"*?. Since p # 2 it follows that 
log N,«~? = z mod z?. 
Hence 
—T,(log «) = log N,a-! = N,a-1! — 1 mod p?*t?2, 
Since K is unramified over Q,, we have x ~ p, and the corollary follows. 


Corollary 2. Let A = G,, be the formal multiplicative group. Let € be a 
primitive p"*+th root of unity, and let « = 1 mod p,,.. Then 


(¢, a) = (~Cips* UT ,(log a) 


Proof. Special case of Corollary 1. 


The law of Corollary 2 is one of Artin—Hasse’s laws, obtained here by 
Wiles as a special case of the Lubin—-Tate formalism. We have written the 
symbol with the usual parentheses, transfering its meaning to the multiplica- 
tive group. 

We shall now state the main result of this chapter. Let ae K*. Let 
r = ordy, a. Let g(X) = c,X" +--+ be a power series in o[[X]] with a unit 
c, such that 

a = g(x,). 
Of course, there exist infinitely many such power series. Let 
D = d/dx 


be the ordinary derivative of formal power series, so that 


Dg(X) = 9'(X). 


D,L(«) = g'/g(xn). 


The operator D,L depends on the choice of element 


Define 


(Xo, x1, bar +) E T,(A), 


and it depends on the choice of g. We shall see later to what extent it does 
not depend on g. 
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We define 


Again this depends on the choice of g. 
Let xe A(p,) and let «, ¢ K*. In Lemma 3.2 we shall give conditions 
under which the symbol 


[x, Onn = = Tm(Aa()5n(Cn)) 


is well defined mod x"*} independently of m. These conditions involve either 
x being sufficiently divisible in A(p,), or m being sufficiently large. The value 
of the symbol lies a priori in K/x"*+o, but it will turn out that under suitable 
conditions, its value lies in o/x"*1p, so that it can be viewed as an operator 
on A,»+1, This was the reason for selecting the bracket in the notation. 
Precisely, the conditions are as follows. 


Condition (i) m > 2n + 1 and there exists an integer 


k > [n/2] + 2 + 1) if p is odd 
k > max{[n/2] + 27+ 1,e+ 1} ifp = 2, 
such that am = Nim, with a, € Kx. 
Condition (ii) m > n and ord, x = [n/2] + 2. 
Theorem 1.2. Let x € A(p,) and ae K*. Suppose « = Nyin&m for some 
Om € K,*. Under either one of the conditions (i), (ii), the symbol [x, onlm 
has value in o/n"**o and we have equality 
<x, OA = [x, tm}a(%n). 
An important case is that when 
a = (%, &,...)€ T(K%). 


Thus a, satisfies an infinitely regressive norm-divisibility condition. In that 
case we may define the symbol 


[x, a] = [x, Om dm 
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for arbitrarily large m, and its value will be the same for any m > 2n + 1. 
It gives the formula 


{X, On>n = [x, we] a(n). 


Example. The formal multiplicative group. For B = 1 mod », and «e K* 
we defined the classical norm residue symbol 


(B, O)n =, <B a 1, Oe + 1, 


where A = G,,. Consider the special case when K = Q, and xz = p. Let € 
be a primitive p"*!th root of unity. Then 


xy =O-1. 


We have 1,(X) = log(1 + X), so 


: 1 Ss 
Aa(%n) = ta” ce 


Let xe p, = G,,(p,). Then we find for m = 2n + 1: 


(a, = wherey = ar T, (Clog (1 + x) Dy, log «). 


This is Iwasawa’s formula [Iw 10]. 


Finally there is another Artin—Hasse reciprocity law generalized by Coates— 
Wiles to the Lubin-Tate case for level 0. 


Theorem 1.3. Let x € A(p@) and a € K¥. Then the symbol [x, a]) has values 
in o/n0, and we have 


<x, Do = [x, a]6(xo). 


The rest of the sections will be devoted to the proofs. By LS 2, we may assume 
without loss of generality that A is basic, so that we can apply LS 6. 


§2. The Logarithmic Derivative 


In this section we investigate systematically the logarithmic derivative, when 
it is well defined (modulo certain powers of the prime), and also to what 
extent the mapping 6,(«) is well defined. We let: 


®, = different of K, over K; 
Din = different of K, over K, for m > n. 


Then 
Do = TP. *, Dasiyn = NOn+1> Dy = A"DoVy. 
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These are immediate by considering the basic Lubin—Tate generators w,, 
satisfying the equations: 


wa-t+n=0 and wi,, + tWaa1 — W, = 0. 


The relative different is obtained by taking the derivative, and evaluating at 
Wo and w,41 respectively. The given values fall out. 
Let a € K, and write « as a power series 


a = g(x,) with g(X) = c,X" + higher terms 


and c, equal to a unit. Let g(X) = X’A(X). Then 
g'le(X) = % + WAX). 


Hence g’/g(x,) is integral if r = 0, and in any case lies in pz}. If 
&(X) = XA,(X) and g(X) = Xth,(X) 


are two power series whose values at x, are equal to a, and f(X) is the 
irreducible polynomial of x, over K, then 


gi(X) — g(X) = XF(X)Q(X) 


for some power series p(X) € o[[X]]. Hence 
, , Xn pr 
81/81(%n) — 82/82(%n) = $F f’On)POn) = 0 mod D,,. 


This shows that g’/g(x,) is well defined modulo the different. 
DL 1. The map D,L is a homomorphism 
D,L: K¥ + pz! mod ®,. 
and the image of the units lies in 0, mod D,,. 
This is obvious from the previous discussion. Since A’(X) is a power series 
starting with 1 and with integral coefficients, it follows that 4’(x,) is a unit. 
Hence from the definition of 6,, we find: 
DL 2. The map 6, is a homomorphism 
6,: K* — K, mod Doo,. 


Its image lies in n-"p,* mod Do0,, and the image of the units lies in 
m~"™ mod Dob. 
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As the elements of K* are generated by powers of x, and units, the com- 
putation of 6, is reduced to 6,(x,) and 6,(units). Note: 


DL 3. For o € Gal(K*/K), and «e K*, 


5,(a°) = x(0)5,(a)’. 
Proof. Write « = g(x,) as usual. Then 


oa = g(ox,) = g([x(o)]x,). 
Thus we let g,(X) = g([x(c)](X)), so that 
ou = £,(X;). 
On one hand, we have 


(1) n" Sq(0)” = 57 ax 58 'Ig(6Xq) = Tes (eee gob), 


On the other hand, since J © [x(o)](X) = x(c)AC(X), we find 


(2) A (GXn)[H()]'(%n) = #(0)A'(Xn)- 
Furthermore 

(3) n" 6,(oa) = Fig) 23 fe (). 
and 


8a (X) = g'([x(o)(X)[x(o)]'(X). 
Putting (1), (2), (3), (4) together yields the desired property. 
DL 4. Let m = n and let « be a unit in K*. Then 


Om(%) = 6,(«) mod Dodp. 
Proof. The proof is similar to DL 3. We know that 
Ao [x™-"(X) = n™-"A(X), 
so by the chain rule, 
M(Xn)Ma-"Y'%m) = A "A'(Xm)- 
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We have the representations 


a = g(x,) = g([n"~"]Xn) = &m(Xm) 
where 
&n(X) = go [n™-"](X). 


Since « is assumed to be a unit, the power series g starts with a unit, and so 
does the power series g,,, So both these power series can be used to compute 
the logarithmic derivative. The rest of the proof then follows immediately 
from the chain rule and the definitions. 


DL 5. Let m = n, and a, € K*. Then 


On(Nin,n&m) = TnnOm(%m) mod On. 


Proof. Without loss of generality we may assume that m =n + 1. We 
first deal with the case when a@,, is a unit. We find: 


bn(Nin,n%m) = Om (1 an) = a Om(On) by DL4 


oe 


= > (0)5m(%m)”. by DL 3. 


The sums are taken over a € Gal(K,,/K,). For such o we must have 
x(o) = 1 mod x™*? 


because [x(c)] is the identity on A ,»+1. Since 6,,(a,,)° liesinz~="o,, = m~%*p,,, 
the desired congruence follows. 


It will then suffice to prove DL 5 next for «, = X,, because of the multi- 
plicativity of the function 6. For simplicity, let us first suppose that the 
Frobenius power series associated with the Lubin-Tate group is in fact a 
polynomial, 


[n(X) = f(X) = 2X 4---+ X4, 


and that the coefficient of X% is exactly 1. For instance, the basic Lubin-Tate 
group and the formal multiplicative group are of this type. Under this 
additional assumption, we have in fact the stronger property with equality 
instead of the congruence: 


1 1 


DL 6. bn(Nnn(— Xm)) = Tn,nOm(— Xm)s where Om(Xm) = 7" VGn)%m- 


Remark. The minus signs are there to take care of the case p = 2. If p # 2, 
they can be omitted. 
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Proof. We may again suppose m = n + 1. We have (as in the proof of 
Theorem 2.2 of Chapter 8) 


Nn +1,n(— %n41) = —Xn 


The formula to be proved amounts to 


1 1 
A (Xn)Xn 7 Pavia ae Ten)" 
We have x, = f(xn+1), and since A(f(X)) = A(X), we have 
(Ac f)(X) = MFXOS (XY) = 24(X). 


We put X = x,.,,, and see that the formula amounts to 


x, 
T,, n Tn on = 
oe (; acs} 


We replace Xn by I (Xn+1) = 1Xasi bot Xie. Let « = Xn+1- Standard 
orthogonality relations of elementary algebra (see for instance Algebra, 
Chapter VII, §6) yield 


gi-1 oe 
Travia $75) =1 ifi= q 
0 ifixg. 


This proves what we wanted. 


The proof of DL 5 in general when «, = x, follows exactly the same 
pattern, but we end up only with the asserted congruence. We give the details. 


By the Weierstrass theorem, we may factor in 9,, 


S(X) — Xn = B(X)A(X) 
where 
B(X) = bp +--- 4+ by-1 X97? + X4, b, = O mod p, 


A(X) = co + ¢,X +--+ is a unit power series, Co € oF. 


Then f'(Xn41) = 2’(%n+1)4(%n+1)- Proceeding as before, we are reduced to 
proving the congruence 


x. 
T,, n = oom = 1 mod ae 
ha (; es) ® 
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Again we replace x, by /(x,+1). From the factorization we have 
Co = 1 (mod x,). 
Hence 
A(Xn41)71 = 1 mod Xq41. 


From the orthogonality relation, we obtain a contribution of 1 from the trace 
of one term. From the definition of the different (which is precisely g’(x,.1)) 
it is then clear that the traces of all the other terms are =0 mod },, as desired. 


Property DL 5 can be expressed in the projective limit as usual. Let 


T(Ka/0o) = projective limit of the additive groups K,/o, under the 
trace maps, 


= group of vectors (Zo, Z,,...) with z, € K,/o, such that 
Th+inZn+1 = 2n- 
Then the map 
6: T(KS) > T(Ka/00) 
given by 
(. +5 Ons. JE>(..., On(@n), - -) 


is well defined, and is a homomorphism. 


§3. A Local Pairing with the Logarithmic Derivative 


Having derived the necessary formalism for the values of 5,(«), we may now 
combine this with the logarithm on A to define the symbol 


[x, alm = + Ta(A()5n(@) for a K#. 


Lemma 3.1. The symbol [x, «],, is well defined mod x"*? in each of the follow- 
ing cases: 


(i) x € A(p,) and m = 2n + 1; 
(ii) x € A(p2%") and m > n. 
Proof. By DL 2 we know that 6,,(a) is well defined mod Do0,,. Hence the 
symbol is defined mod x"*? if 
1 
Ta(2 1(3).0m) Cc n™t1p, 
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By the definition of the different D7 1 = 1-"Do 10, this is equivalent with: 
n™-"-2)(x)DZz is integral. 


It will even be shown that in case (i), ™~"~?A(x)Dp is integral. 
For future reference, we prove congruences which imply the above, and 
list them systematically. 


C1. If xe A(p,) and m > 2n + 1 then 
1 
Ta( 5 4(3)0n) c nttly, 
C 2. If x € A(p2”) and m = n then 
Ta(2 4(3)®e0q) C ato, 
C 3. If x € A(p2"D_) and m = n then 
1 
T (4 4(3)n) Cc ty, 


Observe that the D, does not occur inside the trace in C 1 and C 3. Only C1 
and C 2 are needed for Lemma 3.1 but C 3 will be needed for Lemma 3.2. 
We now give the proofs. 

Suppose first that m = 2n + 1 (the worst case of (i)). We have to verify 
that 


n"-12(x)Dp is integral. 


Recall that ord, Dy = (¢ — 2)/(¢ — 1). 
By Chapter 8, §6, Lemma 3 we know that A(x) is a power series in x whose 
terms are either integral, or at worst with a factor 


aaa and i> 1. 


Suppose x € A(p,). Then 
ord 2*-1A(x)D, >n—1t+ fees ey ee ao 
gq —- 1) q-1 


We need the right-hand side >0. For i < n — 1 this is obvious, because 
n—1-—i2 0 and the other terms are positive. For i > n the estimate is 
equally easy. 

Suppose next that x € A(p2%") and only that m > n, say m = n which is 
the worst case. Then x lies in the disc of “good” convergence for the log and 
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exponent, and thus 


ord A(x) = ord x = a 7 


For C 2, it suffices to verify that 2~?A(x)®@ is integral, or equivalently 


2 q¥-~2.5 
q- < ¥; 


Rey ap aa i 


which is obviously the case. The proof for C 3 is the same. 


We remark that in the range where the symbol is well defined, it is 9,-linear 
in x and multiplicative in «. In any case, within the ranges of Lemma 1, we 
view the symbol as having values in 


K mod 2"*?1ox. 


The next lemma will show that the value [x, «,], is independent of m when 
Om, is the component of an infinite vector 


a = (%, %,...)€ T(KS), 
and m is sufficiently large. We define [x, «] to be this value. 
Lemma 3.2. Let k => m => n. Let a, € Kx and a, € K¥ be such that 
Om = Ne m%e- 


Then 


[x, a Je = [X, nm mod nt 
in either case: 
(i) x € A(p,) andm > 2n + 1; 
(ii) x € A(pi"Do) and m = n. 
Proof. We have: 


[x, Om bm oe [x, Num] 


in(A(X)O m(Nic,m%e)) mod 2” *+ 


: m(A(X) Tic, mOx()) by DL 5 and C 1 or C3 


—_ 


= Tif A(X) 6 (o;.)) 
= [x, Odie, 
as was to be shown. 
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§4, The Main Lemma for Highly Divisible x and « = x, 


Only the statement of the main lemma will be used later, and we recommend 
to the reader to read the next sections before reading the proof of the main 
lemma. 
Lemma 4.1. Let x € A(p,). Suppose that 
ord, x = [n/2] + 1 + 2e, 
where e is the ramification index of K over Q,. Then 


(X, Xan = [x, Xn}a(xn)- 

Proof. First we remark that for the applications, the exact nature of the 
order condition on x is irrelevant, and the reader will find it easier just to 
assume that the lemma is being proved under the assumption 

ord, x > [n/2] + 50,000e, 
or any other high multiple of e, which would do just as well. Also, instead 
of [n/2], any expression like [n'~*] would do just as well. In the next section 
it will be shown how to use a duality to lower back such expressions to the 
precise ones which we ultimately want. 

We let ord, x > t(n), and derive sufficient conditions (also more or less 
necessary) for the method of proof to yield the lemma. 


During the course of the proof we shall constantly be interchanging 
logarithms with the first term in the expansion. If ord, x > 1 then 


2 
A(x) = x mod = 


If p is odd, we even have 
A(x) = x mod x?. 


Furthermore, if ord, y > e then 
y? 
log(1 + y) = y mod om 
The formula to be proved is 


cae = [as T(2Q) gee) | 
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We first want to replace A(x) by x on the right-hand side. It suffices for this 
that 


T, ae pas = 0 mod z"*1 
Nth A (x,)Xn) : 
Since A’(x,) is a unit, this is equivalent with 


aly eae Sel 
(1) RO a8 3 a a oe ° 


Certainly t(n) > (7/2) + 2 suffices. Again since A’(x,) is a unit, proving the 
formula for all x is equivalent to proving 


Grads) = | To(Z)] rd. 

This will be done by the sequence of following steps. 

Step 1. <A’(x,)x, X.> = <x + Xp, Xn> 

Step 2. <x, + x, x,> = [-—1]}«x,, x, + x> 

Step 3. <x, X, + = Ca 1+ =». We then apply the basic re- 
ciprocity law of Theorem 1.1 to show that this is equal to 

1 
[ae ToGo) | Gs) 

to conclude the proof. 


Step 1. We shall use the expression involving the mapping yw, in Chapter 8, 
§7, formula LS 7, namely 


{X, Xn) = [TAY n(%n) lan), 
= [7, a XWn( Xn) a(%n)- 


Indeed, A(x) = x mod n™™ and 7,(n?*wy,(x,)) = 0 mod x"*1, because the 
image of w, is contained in AA(p,)+, and 


AA(Pn) > 10, 80 AA(P,)*  (n0,)4 = 2-IDZ?. 


Therefore we need 


T, x = 0 mod z"t? 
"\2D, ; 
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For this it suffices that 
(2) 2t(n) =n + 2. 


Again t(#) = (n/2) + 2 suffices. 
Since A’(x,) is a unit, we may replace x by 4’(x,)x to obtain 


(A (Xn) X; x) = [TA (Xn) XW (Xn) La(Xn)- 


By Taylor’s formula, using the fact that 4’(X) has integral coefficients, and 
MXn) = 0 we get 


2un) 


2 


Mx + X_) = A'(x,)x mod 
provided that also t(7) = e. So let us make 
un) > 5 +1 + 26, 


As we have already seen that 
T (407 (Xn)) = Omod n**?, 
we may replace A’(x,)x by A(x + x,) to conclude the proof of Step 1. 
Step 2. Formally, this is just the alternating property LS 6 of the symbol 


<a, a> = 0, but the proof has to be adjusted because the groups involved on 
the right and left do not play a symmetric role. We have: 


Om dint ay t= Care to + 2 TH Ga + 


= C6 + x)[-]x, 1+ = 
ke ie = [+] <x_ + x, Xq) 
= Cos + x)[—]x_, 1+ =» 
[+] (Xn, Xn + xD [4+] <x, + X, xn>. 


There remains to prove that the first term on the right is 0, and for this it 
suffices to show that 


(xt, + x) [—]x,€[r']A(p,) and 1+ = © Kx? 
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with positive integers r, d such that 
(3) r+ed>n+l. 


Let F(X, Y) be the group law on A. Since F(O, Y) = Y and F(X, 0) = X 
we see that 


F(X, Y) = X¥ + Ymod XY. 
It follows at once that 
(x + x,) [—] x, = 0 mod x = 0 mod x™, 
But then we may take 
(4) r =n) —1. 


To solve 1 + y = u?* with some unit u, for y = x/x,, we use the fact that 
the ordinary log and exp preserve the order on the disc of elements z such that 
ord, z > 1/(p — 1). It follows that we can take any integer d satisfying 


1 
O<d< Se ae ee 
or in terms of z, 


(5) Osd< — - ——, -— 
For instance it suffices that 
1 
O<d< xn) - 1. 
Picking t(m) = [n/2] + 1 + 2e suffices. This concludes Step 2. 
Step 3. Let y = x/x,, and a = 1 + y. We have: 
(Xn Xn + X) = Xq, 1 + y» 
1 
= | Onc + 9* = 0] Ge. 

We contend that 

NACL + y)~* = —T,(y) mod 17"*», 
whence it follows that 

1 

<Xns Xn + x> = |-as T.(9)] 0 

thereby concluding the proof of the main lemma. 


235 


9. Explicit Reciprocity Laws 


To prove the contention, write V,(1 + y)~* = 1 + z, with 
z = 0mod z"*}. 
(Cf. Theorem 1.1.) Then 
log N,(1 + y)~* = T,@og(1 + y)~*) 
= —T,(y) mod T,(4¢y’o,). 
This amounts to the same type of congruence as before, and is obviously 


satisfied. This concludes the proof of the main lemma. 


§5. The Main Theorem for the Symbol <x, xn>n 
Theorem 5.1. We have the equality 
«x, Xn = [x, Xmln(Xn) 


under either of the following conditions: 

(i) x € A(p,) andm > 2n4+ 1 

(ii) x € A(p2""D_) and m = n. 

Proof. By LS 5 of §5 in the preceding chapter, we have for m = n 


«x, Xa = <[x™-"]x, Xm>m* 


This shifts the burden of the proof to level m, and [x"~"]x is divisible 
approximately of order m (asymptotically for m —> oo). Specifically, to apply 
the main lemma of the preceding section, we want to take m so large that 
ord, [x"~"]x = [m/2] + 1 + 2e. 
Since 
[n"]A(p,) © pe” and [x™-"]xen™- "ps", 
it suffices to take m > 4(n + 1) + 4e. 


Let x’ = [x™-"]x. We apply the main lemma of the preceding section to 
x’ instead of x and m instead of n, to conclude that 


CX, tare = [[2"-"]x, tm] aOm) 
= |wm-* 2 T(A3u(m) | 
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We write this as 
[r"-"a],(xq), where a = + Ty(Ax)5q(%m)). 


This means in particular that x™~"a is integral. On the other hand, there 
exists b € 0 such that 


CX, Onn = [B]aQ%n) = [b]a 2 [2™- "am = [2"-"B] An). 
Hence we have the congruence 
n™-"@a = n™-"b mod 2x™*}, 
whence it follows that a is integral and also that 
a= bmod x"*}, 


which concludes the proof of the equality between the symbols under either 
condition (i) or (ii) according to Lemma 3.2. 


The next theorem shows how one can refine the conditions of Theorem 4.1, 
with a more precise definition of the symbol 6,(x,) for the special case a = xj. 


Theorem 5.2. Assume that the Frobenius power series associated with the 
Lubin-Tate group A has the form 


A(X) = XH + WX, 


i.e., is a polynomial of degree q with leading coefficient 1. Define more 
precisely 


[x, —x,] = tt T, (40) Team)" 


Then for x € A(p,) we have 


«x, —Xa>n = [x, — Xn] a(%,)- 
Proof. First observe that the elements — x,, form a vector 
(— xo; —Xy,-- -) Ee T(K2), 


i.e., each is the norm of the successive one. Instead of using Lemma 3.2, 
however, which relied on DL 5, we may now use directly the more precise 
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relation DL 6 which gives an equality implying the stronger statement: 


[x, XmJm = Tn(A(X) 5 ml a Xm)) 


1 
: 
2 T(AQ)T nsdn —¥n)) 
= 2 T,(A@)5,(—¥); 

= [x, —x,]. 


The value 6,,(—x,) is here taken to be specifically (1/m")(1/A'(Xn)Xm), rather 
than up to a congruence. 


Example 1. Take A = G,, to be the formal multiplicative group. Then it 
satisfies the hypothesis of Theorem 5.2, and we obtain another reciprocity 
law of Artin—Hasse: 


(x, x) = (8H 


where 


bs -) = STs (:. log(l + ») 
Xn = C ae 1, 


and ¢ is a primitive p**1th root of unity. 


Example 2. Take again A = G,,, let p # 2 and let B be the special Lubin- 
Tate group, corresponding to the Frobenius polynomial 


X?-14 p=0. 
We contend that: 


<x, Xo> = 0 if i is an integer prime to p, or i > p. 
«xb, Xo = Xo- 


Proof. For the first statement, we have by multiplicativity: 
0 = <xb, x6> = [7]Kxo, xo). 


If i is prime to p, this proves our assertion because A, is a p-group. If i > p, 
then one sees from the formula with the trace that the symbol gives 0. 

The more interesting case is <x3, X9>. We could work directly on the 
multiplicative group as was done classically, but the functorial formula LS 2 
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of §5 in Chapter 8 shows that it suffices to prove the result on the special 
group, where 


-1 
x§7-* = —p. 


By Lemma 2 of §6 in Chapter 7 (the logarithm for the special group) we know 
that 


A,(x) = x + terms of degree > p. 


Hence 


eh a re Pp 
[x6, Xo] = 3 T( + O(x8-3) Xo log(1 + x8) (mod p) 
1 


= 5 Ta( = #8) (aod p) 
To(— 1) (mod p) 
1 (mod p). 


This proves the assertion. 


§6. The Main Theorem for Divisible x and « = unit 
Theorem 6.1. Let x € A(p,) and suppose 
ord, x = [n/2] + 2, if p is odd 
ord, x = max{[n/2] + 2,e + 1} ifp =2. 


Then for any unit « we have 


<x, On = [x, a]a(%n)- 


Proof. The units are generated by y,-, and the units =1 mod »,. If 
& € Hg, then both sides of the equation are trivially equal to 0. So we deal 
with the units =1 mod p,. A (topological) set of generators for these units 
consists of the elements 


1—exi, withj =1,2,..., 


where eis a(q — 1)th root of unity. The elements x{, form additive generators 
for A(p,) over 0, and in our case, we need consider just those powers with i 
satisfying 


ord, x4, > [n/2] + 2. 
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It suffices to prove the lemma for the symbol <x4, 1 — ex}>, with such values 
of i. We shall reduce the proof to the case <x, x,. 
We start with the symbol [xi,, 1 — ex4]. Since 


—Jexn* 


_ ane ee 
6,1 — exj) = Vx — x4) (Xn) Xn p> 


eXns 


we find 


i = ext] =J S t 1 rytt}. 
[xh 1 ex4] ant 1 2 T,, (264 M (Xn) Xn é x) 


ot J . 4 1 Ty7F 
arti Z Tee A (Xq)Xn EXn 


oO 
=-j > [exit x). 
r=1 


The above formal steps are obviously justified. First the sums taken mod 2"*} 
are actually finite, and second we have replaced A(y) by y and vice versa 
twice in the range where this applies. The equality takes place in K/x"*1o 
where the symbol [x, a] takes its values. 

By Theorem 5.1 we know that 


[e"xn*, XnJa(%n) = <exnT", Xn>- 


Therefore 
[ei 1 = eet Le) 1-7 Dy hou we 
r=1 


and this latter sum is taken on A. Since <x, —1> = 0 by LS 6 if ord, x is big 
enough, it will therefore suffice to prove the next and final result. 


Theorem 6.2. Suppose 
ord, xi = [n/2] + 2 if p is odd 
ord, x, = max{[n/2] + 2,e +1} ifp =2. 
Let j = 1. Then 
<xh, exh — In = [-J] D, Sex, xno. 
r=1 
Proof. Let F be the group law on A. Since 
F(X, Y) = ¥ + Ymod XY, 
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we obtain for x, y€ ,, 
x[+]y=x+ymodxy and x[—]y =x — ymod xy. 
This will be applied when ord, x and ord, y = [n/2] + 2, so that 
ord, xy >n+ 3. 


In that case, A(n"*30,) < [x"**]A(p,), so addition on A and addition on G, 
are interchangeable on the left of the symbol 


KX, On 


under the condition ord, x > [n/2] + 2. 
This being said, we find: 


0 = <xi(exi, — 1), xi(exi — D> 
a= Cert tt xh xh [+] <ext? = ant 1D 


= <exh ts, xt> [4+] <exht4, exh — 1> [—] <n, ex — DD 
whence 
xi, exh — I> = <exht?, exh — 1D [+] <exnt!, xp. 


Note that <x, «> = 0 for all ¢ € wy-3. 
Recursively we obtain 


<erxtt 5, x (r- D1» 


| 
Ms 


«xn, EX, — 1D = 
oo 
= > cert, a5 
[> 0) 
= [-f] >, <exir, xn). 
r=1 


Observe that the sums are in fact finite since for large r the left-hand side of 
each symbol in the sum is highly divisible, so the symbol is 0. Hence the above 
formal steps are valid, and the theorem is proved. 


The special case when n = 0 is often interesting for its own sake. The 
next lemma may be omitted in the proof of Theorem 7.1 or Theorem 1.2, 
but is useful for the proof of Theorem 7.2. 
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Lemma 6.3. Assume that B is the special Lubin-Tate group with Frobenius 
power series X* + nX. Let wo € B,. Then for i = 2 andj = 1 we have the 
same formula with n = 0 as in the previous lemma, namely 


<wi, we — 10 = [-J] >, Swit", Woo. 
r=1 


Proof. By Theorem 2.3(ii) of Chapter 8, we know that the group law on 
B satisfies 


F(X, Y) = X + Y + terms of degree > g. 
If x, y € Po? it follows that 
x+y=x[+]y mod pet. 


But 2? < [x]B(p,). Hence again addition on A and addition on G, are 
interchangeable on the left of the symbol <x, a>» under the stated conditions, 
and the rest of the proof is then identical with that of Theorem 6.2. 


§7. End of the Proof of the Main Theorems 
Theorem 7.1. Let x € A(p,) and let 


& = (Ons Onsiy---> Ox) With Om = NemO&n- 
Assume that 
k = [n/2] + 2” + 1) if p is odd 
k > max{[n/2] + 27+ ),e+B fp =2. 
Then [x, &m] lies in o[n™*10 for 2n + 1 < m < k, and we have 
CX, On >n = [5 Cm }n(%n) 


for such m. 


Proof. The theorem has already been proved when « is a power of x, or 
when « € H,-,. We may therefore assume that « is a unit =1 mod p,. We 
reduce the theorem to the result of the preceding section in exactly the same 
manner that Theorem 5.1 was reduced to the main Lemma 4.1. This time 


we need 


ord, [2*-"]x > [n/2] + 2 if p is odd 
ord, [x*-"]x = max{[n/2] + 2,e + 1} ifp = 2. 
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Clearly then, the lower bound on k given above is sufficient. The proof is 
then identical with that already given for Theorem 5.1, as desired. 


For the case n = 0, special features arise, and we next give a generalization 
to Lubin-Tate groups of another explicit formula of Artin—Hasse. 


Theorem 7.2. Assume p # 2. Let x € A(p3) and « € KX. Then 
1 
[x, @]o = = To(A(x)50(a)) 
is well defined mod 1, lies in o/no, and we have 


«x, &>o os [x, a}§(Xo). 


Proof. Since 4(x,) = 0 it follows that A(po) = A(p?) = 2. This shows 
that [x, a]o is well defined mod z because 5p(«) is well defined in p>! modD,. 

By formula LS 2 of Chapter 8, §5 it suffices to prove the theorem when 
A = B is the special Lubin-Tate group. In that case we already know the 
result when « = X) or when a € p,_,. We may therefore assume that « is a 
unit =1 mod pp. In that case, we follow the same arguments as in the proof 
of Theorem 6.1 using Lemma 6.3 instead of Theorem 6.2. The reader can 
check that each step is valid, to conclude the proof. 
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This chapter gives a number of complements to Chapter 4. In §1 we extend the 
formalism of the associated power series to the change of variables 


xoy 


for x € Z, and y equal to a topological generator of 1+pZ,. A measure on 
1+pZ, then corresponds to a measure on Z,, and we give relations between 
their associated power series. This is then applied to express Bernoulli 
numbers B,, , as values of power series. We write 


x= day a AW, 


where first 6 is an even character on Z(dp)* (d prime to p), w is the Teichmuller 
character, and yw is a character on 1 + pZ,. Let € = w(y). Then 


1 
k By, y ee Sou - 1), 


where f,, depends only on @ and k. This allows a partial asymptotic deter- 
mination of ord, B,, , when 0 is fixed, and the conductor of y tends to infinity, 
due to Iwasawa [Iw 14], §7. This gives rise to the corresponding asymptotic 
estimate for the minus part of class numbers of cyclotomic extensions. 

The Iwasawa expressions for the Bernoulli numbers gives an asymptotic 
value for their orders: 


ord, By oy = mp" + An+c 
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for n sufficiently large, cond w = p"*'. In order that m # 0, Iwasawa showed 
that a system of congruences had to be satisfied (essentially that the coeffi- 
cients of the appropriate power series are =0(mod p)). We derive these 
congruences here in each case successively. The next chapter is devoted to 
the proofs by Ferrero-Washington that these congruences cannot all be 
satisfied, whence the Iwasawa invariant m is equal to 0. 

At the end of their paper, Ferrero-Washington conjecture that the in- 
variant A, for the cyclatomic Z,-extension of Q(p,,) satisfies a bound 


log p 
A, <——.. 
p< log log p 


I am much indebted to Washington for communicating to me the exposition 
of the steps which lead to this conjecture, and which were omitted from their 
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We let p be an odd prime for simplicity. The multiplicative group 1 + pZ, 
is then topologically cyclic, and we let y denote a fixed topological generator. 
Then y mod p” generates the finite cyclic group 1+pZ, mod p” for each 
positive integer n. For instance, we may take 


y= 1t+p. 


[Note: If p = 2, then one has to consider 1+4Z, instead of 1+2Z,.] 
There is an isomorphism 


Z, 7 1+pZ, 
given by 


xh y. 


Its inverse is denoted by a, so that by definition 


a(y*) = x. 


Let d > 1 bea positive integer prime to p. We shall consider measures on 
the projective system of groups 


Z, = Z(dp") = Z/dp"Z = Z(d) x Zp"). 
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The projective limit is simply denoted by 
Z = Z(d) x Z,. 


A measure is then determined by a family of functions y, on Z,, a8 in 
Chapter 2, §2. We let 


Z* = Z(d) x Z} and Z** = Z(d)* x Zt. 
An element z € Z* can be written uniquely in the form 
z = (Zo, ny") = (Zo; Zp) with zy) € Z(d),n €p,-1,x € Zp. 
We define the homomorphism 


a: Z* +Z, by azo, ny*) =x. 


We define as usual 


{Zp = <2) = (2 =, 


so that a(z) = a(<z>). As above, we usually omit the index p on <z),. 

A continuous function on Z, gives rise to a continuous function on 1 + pZ, 
by composition with a, and conversely. 

As in Chapter 2, §1 we let v be the ring of p-integers in C,, and we let 
be an v-valued distribution, i.e. a measure. 

By the basic correspondence between functionals and measures, we obtain 
the following theorem. 


Theorem 1.1. Let be a measure on Z with support in Z*. Then there exists 


a unique measure a, on Z, such that for any continuous function @ on 
1+ pZ, we have 


[ oC<ar) data = [960 ala ud0. 


We now describe the power series associated with «, modulo the poly- 
nomial 


h,(X) = (1 + x)?" — 1. 
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Thus we fix a value of n > 0, and for each a € Z* we let r(a) be the unique 
integer such that 


0 <r(a) <p" and r(a) = aa) mod p". 
Theorem 1.2. Let f be the power series associated with a, 1. Let 
Zt,, = Z(d) x Z(p"**)* 
Then 


SX) = Ye base (a+ XY mod h,(X). 


aeZhe1 


Proof. By the definition of the associated power series, we have 
pri 
fH= Y @wMOd+Xy. 
r=0 
But letting char denote the characteristic function, we have: 


(a, 4)(r mod p") = | (char of r mod p”) d(a, ») 
Zp 


a nae of Z(d) x py-, x y*?"") du 
(by Theorem 1.1) 
= Y Mn+i(ny’ mod p"**) 
n 
where this last sum is taken over 7 € Z(d) x p,- 1. This proves the theorem. 


Corollary 1. Let be a nontrivial character of 1+pZ,, with conductor 


p"*!. Define Wa) = W(<a)). Let 
Wy) = ¢ = primitive p"-th root of unity. 
Let f be the power series associated with a, pu. Then 
{ wau = 6-0. 
Zp 
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Proof. We have 


I Wdu= [ W(y*) d(a, u(x) (by Theorem 1.1) 
2 Zp 


x © dly u(x) 
Zp 


= f(¢-1). (by Theorem 1.2 of Chapter 4). 
This proves the corollary. 


We continue with the same notation as in the theorem. We shall use the 
notation 


BH) = | Wdu= £6) 


Suppose that there exists a rational number m such that the power series f 
can be written in the form 


(Y= (Co +¢o,X + i re, Ca +¢o,X* + vee) 


where c, is a unit in 0, and co, ..., Cg_, € m, the maximal ideal of v. We call 
m, A the Iwasawa invariants of y, or f. If the measure yu has values in the 
maximal ideal of the integers in a field where the valuation is discrete (which 
is the case in applications), then f has coefficients in that ring, and such m, 
A exist if f # 0. If m = 0, then A is the Weierstrass degree of f- In any case, 
A is the Weierstrass degree of p~"f. 

As usual, we shall write 


x~y 
to mean that x, y have the same order at p. 


Corollary 2. There exists a positive integer ng (depending only on f) such 
that ifn > no and cond w = p", then 


BW, Hw) ~ p™(o —1)* 
where ¢ is a primitive p"-th root of unity. 


Proof. As n — oo, the values |—1| approach 1, and so the term c,(¢—1)* 
dominates in the power series f(¢ — 1) above. 
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Corollary 3. For some constant c = c(f), we have 


ord, [] BW, u) = mp" + An+c(f) 


cond w=pt 
Nost<sn 


Proof. Since 


the formula is immediate, since the product taken for no < t <n differs by 
only a finite number of factors (depending on ng) from the product taken over 
all t, and we can apply Corollary 2 to get the desired order. 


In the light of Corollary 3, we shall call m the exponential invariant, and 
2 the linear invariant. 
Let f be as above, the power series associated with a, 4, and put 


co = Y bas i(ny” mod p"*?). 
n 


n—4 


f(Y= ¥ MA+XY mod h, 


r=0 


Then 


pr. 
bier ue @ mod h,, 


r=0 


where the coefficients a”) are obtained from the change of basis from 
LeXpase, XP ot 
to 


1,14+X,...,04+ xy". 


We can rewrite cy” in terms of the variable u = y’, namely 


cu) = Y Uns (nu mod p"*?). 
n 


These coefficients c(u) will be called the Iwasawa coefficients. 
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Theorem 1.3. Let n be an integer > 0 such that c is a p-unit for some integer 
r with 


O<r<p"-l. 


Then the exponential Iwasawa invariant m of p is equal to 0, and we have 
A<p". 


Proof. Some coefficient a” must also be a p-unit with r in the same range, 
and we can write 
pr—1 


f(x) = L aX" + gy(X)X?" + pgo(X), 


where g,(X), g2(X) € o[[X]]. Hence the coefficient a, of f(X) is itself a 
p-unit, whence the theorem follows. 


We shall sometimes deal with certain measures derived by the following 
operation from py. Let s € Z,. We define the s-th twist of 4 to be the measure 
defined on Z* by 


ua) = <a>*u(a), 


and equal to 0 outside Z*. In that case, the coefficients c\” should be indexed 
by s, ie. 


ch) = 
Since 7’ is a p-adic unit, it follows that the same power of p divides all c{"} 
as divides c‘”’. Thus Theorem 1.3 also applies to the twisted measure and the 


power series f, associated with «,(u'°) instead of f in the theorem, and we 
find: 


Theorem 1.4. Let m,, A, be the Iwasawa invariants of u. If m, = 0 for some 
s, then m, = 0 for all s. Suppose this is the case, and let n be the positive 
integer such that 
p"~ 1 < Ao < p”. 
Then we also have 
p" < A, < p” 
for all s. 
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§2. Application to the Bernoulli Distributions 


Let B, be the k-th Bernoulli polynomial (cf. Chapter 2). We had defined the 
distribution E, at level N by 


1 
Ey) = NET )) 


N = dp", 


We shall now use 


where d is a positive integer prime to the prime number p. 
We continue using the notation of the preceding section. An element 
of Z = Z(d) x Z, is described by its two components 


X = (Xo, Xp). 


Let ce Z(d)* x ZF = lim Z(dp")*. We define 


Bx) = B(x) — chEWC7*x) 


for x € Z(N). The multiplication c~ x is defined in Z(N)*. 


Note. In Chapter 2, we took c to be a rational number. This is not neces- 
sary, and restricts possible applications too much. When c occurs as a coef- 
ficient in Chapter 2, we must use c, instead of c, i.e. we must use its projection 
on Z*. When c occurs inside a diamond bracket, then no change is to be made 
for the present case. For instance, we have 


-1 
El. E(x) = (*) = () & 5 (ep 1). 


Similarly, formula E 2 and Theorem 2.2 of Chapter 2 yield the relation 


E 2. E,, Ax) = ar 1E, A(x) 


symbolically for x € Z. We then obtain the integral representations of the 
Bernoulli numbers as follows. 


1 


1 
— B, = ——— | xt"! dE,,.x), 
k k 1— ck [4 d 1,AX) 
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provided only that ck # 1. Furthermore, if x is a character of conductor 
m = m, dividing dp" for some n, then x defines in the usual way a function on 
Z(N) for m|N by composition 

Z(N) > Z(m) * 0*, 


and x is defined to be 0 on elements of Z(m) not prime to m. Then we define 


1 
k By, y = [x dE. 


Note. This definition made by taking into account the conductor of x 
is more appropriate than that of Chapter 2, §2. There we dealt only with 
characters of ZF, so it made little difference, only for the trivial character. 


More generally, if g is a locally constant function (step function) on Z, 
then we can define 


1 
j Be — [o dE,. 
Zz 


Then 
k-1 1 k 1 
(1) 90x00 xp dE) = 5 Bao — 6 5 Brave 
In particular, if g is a character y, then 
k-1 k 1 
U(x)x, ° dE, (x) = (1 — x(c)c;) k By, y- 
Zz 


We define the p-adic L-function by the integral 


LU — s, x) x(a)<ay3a,' dE,, (a). 


—1 
~1— 70s i 


If the conductor of x is dp" for some n > 0, then the support of the integral is 
really on the set 


Z** = Z(d)* x Z¥. 
Let w = w, be the Teichmuller character, and put 
—k 


Xk = XO 
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Theorem 2.1. For every integer k > 1 and character x of conductor dp" 
with n > 0, we have 


feet 
Ly(1—k, x) = ~~ x4(0)PF") & Ba x: 


Proof. We have: 
~(1-4(Oe)L,(1—k ) = yaa! dE, (a). 
z* 


Write 


Let N = dp"*'!. Then 


py Ens e py 
{ lim > x(p)p "ayy" E..( a ) 
pZ n>o y=0 


npr lim 5 yO) (2) 
k c k l,e N/p 


n>co y=0 


" 1 
= x,(p)p*~ (1 —x(c)c$) j Biome 


The theorem follows at once. 


We now let 


0 = even character on Z(dp)*, 8 # 1, cond 6 = d or dp. 
x% = Ow where w is a character on 1+pZ,. 


Then 


1 
1—x(c)Xcdp Jae 


1 
= HO! i Wap) dita) 


where y is the measure given by 


1 
(1—x.(p)p*~") 5 Bem = W(a)Oe~ "(a)a,~* dE, (a) 


(a) = O(a)" “(aay *E,, (a). 
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Therefore by Corollary 1 of Theorem 1.2 we find that 


1 
To plonceyk ony =<1) 


1 
(1— xa? )PF') | Be, am = 
where fy ,(X) is a power series given mod h, by Theorem 1.2. 
We may use formula E 1 of Chapter 2, §2 to give the value of y at inter- 
mediate levels, namely 


-1 
Mn+ (a) = aor4as'| (2) - o,(4) Se o} 


Starting with the general formula of Theorem 1.2, we shall derive a slightly 
simpler expression for the coefficients of fg, ,, which can be written in the form 


(2) fo, (X) = Yul + XY mod h,, 


where 


-1 
clu) = ¥ 0a-"(n)(n yuk? [tts - c(t) + Hep - »} 


The sums are taken for ue 1 + pZ, mod p"*! and ne Z(d)* x p,-1. The 
component n, is just «(y). The character @w~' is odd, and in particular is 
not trivial. Hence the sum over n times the factor (c,—1)/2 is equal to 0, 
and that term can be omitted. 

We now select c ¢ Z(d)* x p,-,, so that <c>, = 1. Furthermore x(c) = 
@(c). We can select c such that y(c) # 1. We change variables in the sum over 
n, with respect to the second term involving c™ 'n, letting 7 ++ cy. Then we 
may combine the sums over both terms, with a factor 


1 — x(c) 


which cancels 1 — y(c)<c>*é = 1—(c) in front. In other words, we find: 


(3) cu) = 2 Beo™ (nuk ( a :). 


We are interested in applying Theorem 1.3. In other words, we are in- 
terested in proving the Iwasawa conjecture that some coefficient of fy, is a 
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p-unit. Clearly the power u*~ + can be disregarded for this purpose. Thus the 
expressions (3) for the coefficients of the Iwasawa power series give rise to 
the following criterion. 


Theorem 2.2 (Iwasawa congruences). Let d be an integer > | and prime to p. 
Let 0 be an even character #1 of conductor d or dp. If no coefficient of 
fox is a p-unit, then we have the congruences (independent of k): 


Z a 
>» Oo ea) = 0 mod p 
n 
for all « € Z*, and all integers n > 0. 


Proof. We have proved the assertion when a lies in 1+ pZ,. However, 
for any fixed 9 € w,-, we can make the change of variables 


nt>1MNo> 


leading to the congruences as stated above. 


Theorem 2.3 (Ferrero—Washington). For 9 # 1, not all these congruences 
are satisfied, and therefore some coefficient of fo, iS a p-unit. 


The proof that not all these congruences are satisfied will be given in the 
next chapter. Here, we first give formulations for these congruences which are 
more easily dealt with. Then in the next section, we indicate how this result 
applies to the divisibility of class numbers in the cyclotomic Z,-extension. 

The case d = 1. We shall give an alternative version of Iwasawa’s con- 
gruences adapted for the Ferrero-Washington proof. Write any element 
z € ZF as a series 


z=2% +2,p+22p? +°:- 
with integers z; satisfying 0 < z; < p — 1. Let 
SZ) = Zo + 2p ++ + 24" 


be the n-th partial sum. In the above congruences, we may replace na by 
s,(n%), and then omit the brackets giving the representative as a rational 
number. Furthermore, let us write 
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where v is a positive integer, necessarily odd since we assumed that @ is an 
even power of the Teichmuller character. Furthermore, v # —1 mod p — 1 
because 9 # 1. Multiplying the congruence by p”*! yields 


> s,(na)n” = 0 mod p"*? 


NEMp-1 


where v is a positive odd integer, v # —1 mod p — 1. 
Now in the p-adic expansion of z we let 


Zn = t,(z). 
We shall express the above congruence in terms of f,. 


Theorem 2.4. Let 6 #1 be an even character of conductor p. Then 
the Iwasawa congruences imply that there exists an odd integer 


v 2 —I1modp-1 
such that, for all « € Z* and all integers n > 1 we have 


2¥ t,(an)n” = (p — 1) Yn’ mod p, 
ne& nek 


where & is a system of representatives for p,_, mod +1. In particular 
the congruence class on the left-hand side is independent of « and n. 


Proof. We have 


SiON) = Sy 1(ON) — ty4(on)p"*?. 
Furthermore 
Sn+1(%) = an mod p"*? 
and 
oy nv} = 0 
NERp-1 


because v # —1 mod p—1. Hence the congruence of the theorem is equiv- 
alent to 


D tr+ Cann” = 0 mod p. 
n 
Since to(ay) = ay mod p, we always have 
2 to(an)n” = 0 mod p. 
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Finally, since t,,. ,(px) = t,(x), we are led to the congruence 
Y t,(an)n” = 0 mod p 
n 


for all n and all «. But since 0 = p + (p—1)p + (p—1)p? + ..., 
t,(—n) = p—1 —t,(oyn) forn> 1. 
Therefore 


Y t,(an)n” = 2 D falcon)” —(p-1) dn’, 


n ne 


thus proving the theorem. 


The case d > 1. 
Theorem 2.5. Let 9 # 1 be an even character of conductor d or dpwithd > 1 


prime to p. Let 0, = 0m~'. Then the Iwasawa congruences imply that for 
all a € ZF and all n > 0 we have 


-1 


d 
YY 16,(s,(an) + ip"*!) = 0 mod p. 


neR® i=0 


Proof. In Theorem 2.2 we may rewrite the congruence in the form 
1 — 
dp'*! ys aO,(a) =0modp 


where the sum is taken over a prime to dp, such that 
O0<a<dp"*' and <a>, = <>, mod p"*’. 
We can also replace these elements a by elements of the form 
na + ip"*! withi=0,...,d—1, 


and 7 is some (p — 1)th root of unity. The sum is then taken over y and i. 
The sum over i with the factor ya is then equal to 0, and we are left only with 
a sum having ip"*! as a factor. Combining terms with n and —n, and using 
the fact that 0, is odd yields the desired formula. 


257 


10. Measures and Iwasawa Power Series 


§3. Class Numbers as Products of Bernoulli Numbers 


We continue to let p be an odd prime. We write x ~ y to mean that x = yu 
where u is a p-unit. We let: 


6 = even character on Z(dp)*. 
w = character on 1 + pZ, of conductor dividing p"*’. 


The characters on Z(d)* x ZF of the same parity as k of conductor dividing 
dp"*} can be written uniquely in the form 


wow * = wA,. 


For any integer A with 1 < k < p — 1, we define 


1 
hw Zs per? I] TT 5 Be. von: 


deven wp 


In particular, 


h® =pT] 4 


6 even k 


By, o,: 


We can simplify these expressions in so far as p-divisibility is concerned. We 
need a lemma of von Staudt type. 


Lemma 1. Let k be an integer with 1 < k < p—1. Then 


1 


1 
Beo-k = —- : 
j Bue kp mod Z, 


Proof. The proof is entirely similar to that of the Von Staudt congruence, 
Corollary 2 of Theorem 2.3, Chapter 2, combined with the expression for the 
Bernoulli number as an integral in Theorem 2.4 of Chapter 2. We leave it to 
the reader. 


Lemma 2. Let 1 < k < p—1. Then 


1 
hie aad I] {Boom 


OF1 


Proof. The case when @ = 1 combined with Lemma 1 shows that the 
factor p in the definition of h{ cancels the pole of order 1 at p of the single 
term with @ = 1 in the product. What remains is the desired expression. 
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Lemma 3. Let 1 < k < p —1. Then 


1 
he oe hw Il I] k By, WOw-*- 


O41 WHI 


Proof. Write x = 0. Then 


1 
(1—x(p)p*~ *) 5 Disxe-* = Ha)<aypay ' dE, (a). 


1 
1—(c)<c>} i. 


We distinguish three cases, for the terms in the product defining h™. 


If 96 = 1 and w = 1, then we apply Lemma 1. We use one factor of p from 
n+1 


p"** multiplied with 


1 
k By w-* 


to find a p-unit. 


If 6 # 1 and wy = 1, then we use Lemma 2 to get the h® on the right-hand 
side of the formula to be proved. 


The proof of Lemma 3 is concluded by the next lemma. 
Lemma 4. If 0 = 1 and w # 1 then 


1 1 


Bevo ~ Fo 


where y = 1+p and € = wW(y). Furthermore 
. 1 
Dp I] j Be vo-® ~. 
wt 


Proof. We also take c = 1+p. Then 


1—W(cey*~1-¢ and [] U- =p" 


cpr=1 


We note that x(c) = w(c), and we obtain 


1 
as 1-WoKoe 


n 


P 
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Finally we wish to show that 
W(a)<a>pa, ' dE, (a) ~ 1, 
Zt 
i.e. the above integral is a p-unit. Since 
W(a)=1mod1—f and <a> =1modp, 
it suffices to prove that 


i] a,' dE, (a) 
Zt 


Pp 


is a p-unit. This is immediate by writing down the first approximation at level 
p, and concludes the proof. 


For each 6 # 1 we let A(0, k) be the linear Iwasawa invariant of the power 
series fg, in §2, and we let 


Mk) = oe AO, k). 


From the Ferrero—-Washington theorem and Lemma 3, we then obtain: 


Theorem 3.1. There is a constant c, such that for all n sufficiently large, 
we have 


ord, A? = A(k)n + cy. 


This is merely a special case of Corollary 3 of Theorem 1.2, applied to the 
Bernoulli distributions, as discussed in §2. 
We can then apply the theorem to the class number. 


Theorem 3.2. Let h, be the class number of Q(Upn+1). Then there is a constant 
c such that for all n sufficiently large, we have 


ord, h, = A(i)jn + c. 
Proof. The classical class number formula asserts that 


bh ae ap*? I] ri 5Byy, 


x odd 
so that we can apply Theorem 3.1 with k = 1 to conclude the proof. 
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Theorem 3.3. Let K be a cyclotomic extension of the rationals (i.e. a sub- 
field of a cyclotomic field). Let K ,, be the cyclotomic Z,-extension of K, and 
let h, be the class number of K,,. Then there exists a constant c’ such that 
for all n sufficiently large, we have 


ord, h, = A(1)n +c’. 


Proof. It is an easy exercise from the class number formula of Chapter 3 
to show that the minus part of the class number differs from the product 
giving h‘”) only by a finite number of factors. Hence the same estimate holds 
as in Theorem 3.2. 


In Theorem 2.3 of Chapter 12 we shall prove Iwasawa’s inequality bound- 
ing the order of h, in terms of the order of h, . We then obtain: 


Theorem 3.4. Notation being as in Theorem 3.3, there exist constants 
C1, C2 (depending on K) such that for all n sufficiently large, we have 


ord, h, = cin + C2. 


Remark. Iwasawa developed his theory with the point of view that 
Z,-extensions are analogous to constant field extensions for curves over 
finite fields. The formula 


hy =ho TT fE-D 
g 


is analogous for the function field case of the class number formula. The fact 
that ord, h, is linear in n follows at once from the existence of the Jacobian 
in the function field case. Kubert-Lang theory suggests the possibility of 
using the analogous theory in the modular case to analyze the Bernoulli 
numbers B,, and obtain a bound for the invariant 4 in terms of the di- 
mensions of abelian subvarieties of the modular curves. 


Appendix by L. Washington: Probabilities 


We shall give a heuristic argument which estimates the size of A, = A,(Q(u,)). 
The contribution from A; will be ignored, since Vandiver’s conjecture says 
it should be zero. In any case, A> < A; , so we could alternatively double our 
final estimate. 

Let i(p) = index of irregularity = number of Bernoulli numbers 
B,, By, ..., By-3 which are divisible by p. The idea will be to show that 
usually 


A, = i(p), 
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and that one should expect 
Ay Si(p) + 1 


for all but a finite number of p. 

There are (p — 3)/2 relevant power series. We assume that each coefficient 
is random mod p, and that these coefficients behave independently of each 
other. The first coefficients of these power series correspond to the Bernoulli 
numbers in such a way that a first coefficient is divisible by p exactly when 
the corresponding Bernoulli number is divisible by p. The numerical evidence 
bears out the assumption that the Bernoulli numbers are random mod p. 
However, we are also assuming that the higher coefficients are random and 
independent of each other. This is a more dangerous assumption, and I know 
of no supporting numerical evidence. 

Suppose 1, > i(p) + 2. Then we have two cases. 


Case 1. Some power series has its first three coefficients divisible by p. The 
probability that at least one of the first three coefficients for a given power 
series is not divisible by p is 1—1/p°. The probability that for all (p—3)/2 
power series we have one of the first three coefficients not divisible by p is 


1 \@- 30/2 
(1-3) 
Pp 


Therefore the probability that at least one power series has its first three 
coefficients divisible by p is 


1 \(e- 372 ( 1 
1-ji-—= = O{—]. 
( ) p 
The expected number of times this should happen is therefore finite, since 
Y 1/p? < @. 


Case 2. At least two different series have their first two coefficients divisible 
by p. Reasoning as in Case 1, we see that the probability that none of the power 
series has both of the first two coefficients divisible by p is 


1 \(@@-372 
(-3) 
P 


The probability that exactly one has its first two coefficients divisible by p is 


(p—3)/2 \ 1 \(e-3/2)-1/4 
?P")G-g) 
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Therefore, the probability that at least two power series have their first two 
coefficients divisible by p is 


1 \Me- 3/2) —3)/2 1 \e-3/2)- 1/4 1 
[Oey PO” Gl =) 


So again one expects only finitely many occurrences. 
We therefore expect 


i(p) < A, <i(p) + 1 


for all but finitely many p. Therefore estimating A, is equivalent to estimating 
i(p), which we shall do. 

However, first we shall show that usually one should expect 4, = i(p), 
as was the case in Wagstaff’s calculations for p < 125,000. 

If A, = i(p) + 1, then at least one power series has its first two coefficients 
divisible by p. The probability is 


1 ((p— 3)/2) 1 (=) 
eed a ores a Ol), 
( =) 2p p? 


Therefore the number of expected occurrences of A, > i(p)+1 for p< x 
should be 


1 1 
Y 5 ~ 5 108 los x. 


pax 2 


Since 4 log log (125,000) ~ 1.2, it is not very surprising that 4, = i(p) for 
p < 125,000. In fact, one might expect to search rather far before finding a 
counterexample. A reasonable bound might be 107* since 4 log log 107* ~ 2. 
Also note that the fact that 


1 < 4 log log 125,000 
is really caused by the first few primes. If one considers, for example, 


1 


aL? 
30<p<x 2p 
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then the expected number is much less than 1. Starting the sum at p = 31 is 
perhaps justified by the fact that the early Bernoulli numbers, etc., are too 
small to be random mod p. In fact, even though 39 % of primes are irregular, 
37 is the first one. 

We now estimate i(p). The probability that i(p) = i is 


a = ) (: 2 i) Gd o-1/2 @)! 
i p p as p — 00 i! 


The right-hand side is the Poisson distribution. The probability is as stated 
because i of the Bernoulli numbers are divisible by p, each with probability 
1/p. There are (p — 3)/2 — iof them not divisible by p, each with probability 


1 — 1/p. Finally, there are 
i aa 
i 


ways of choosing the i Bernoulli numbers which are divisible by p. 

For i = 0, we obtain the “result” that the fraction of regular primes is 
e 2 ~ 61%. 

The number of occurrences of i(p) = i for p < x should be approximately 


Taking logarithms and ignoring lower order terms, we find, with the help 
of Stirling’s formula: 


log x 


Oe SPEIER 3502 JO IOS Be NOEY «SONG. igoig 


Since x was the first occurrence of i, we obtain approximately 


i(p) < ee therefore A, < log p 


g log p P ~™ log log p’ 
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For p ~ 125,000, this yields 4, < 4.8 which is close to the truth, namely 
A, < 5. Of course, most of the time A, will be much less than this bound: 
61% of the time we should have A, = 0. The “average value” of A, is 


_ dp<x4p _ ,._ (>. i(p) _ O(log log x) 
ia Yeeel = yA x/log x 
see DD) 


yt 


= ) (i) (probability that i(p) = i) 
i=0 
= sy jew V2 (3) 


ie i! 


Bibliography. D. H. Lehmer seems to have been the first to use probability 
arguments such as the above, since he mentioned that 1 — e~ 1/2? = 39% 
of primes are irregular in [Leh]. Later, Siegel published a probability argu- 
ment giving this result in [Si 2]. Numerical evidence appears in Johnson [Jo] 
and Wagstaff [Wag]. Kummer (last page of vol. I of his collected works) 
claimed that a simple probability argument yields the ratio of irregular to 
regular primes is 4, but it appears he was mistaken. 


§4. Divisibility by / Prime to p: Washington’s Theorem 


Theorem 4.1. Let |, p be distinct primes. If p # 2, let y = p and if p = 2, 
let q = 4. Let x be an odd Dirichlet character of conductor dividing dq 
with (d, p) = 1. If W is a character on 1+qZ, with conductor p"*! suffi- 
ciently large (depending on | and x), then the Bernoulli number 5By wy is 
an I-unit. 


Before proving the theorem, we give its application to the class numbers 
of cyclotomic fields. 


Theorem 4.2. Let K be a cyclotomic extension of Q. Let K,, be the cyclo- 
tomic Z,-extension of K. Let | be a prime # p. Then 


ord,|C(K,,)| 
is bounded. 


Proof. By lemma 2, §1 of Chapter 13 it suffices to prove the theorem when 
K = Q(uaq) for some positive integer d not divisible by p. Furthermore, we 
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may also adjoin an /-th root of unity to the ground field, and thus assume 
without loss of generality that d is divisible by |. Theorem 3.2 of Chapter 13 
then shows that it suffices to prove that 


ord, h, 
is bounded. But we have the formula 


h, = Q,w, [| I = 2Bi yw, 
x 


where y ranges over all odd characters of Z(dq)* and w ranges over all char- 
acters of 1+qZ, of conductor dividing p"*'. The factor Q, is Hasse’s index, 
equal to | or 2, and w, is |-bounded. Hence we may apply Theorem 4.1 to 
conclude the proof. 


In the rest of this section, we reduce Theorem 4.1 to congruences similar 
to those which we have already met. To avoid using notation involving 4 in 
case p = 2, we assume that p is odd. Actually, the case p = 2 is easier and was 
solved by Washington before the general case. 

Let W = w, have conductor p"*!. Let 


By me QM W) = QU Hp) 


be the field obtained by adjoining the values of x and y, to Q. Then B, w 
belongs to F,,. Let T,,,, be the trace from F, to F,,. Let mg > 1 be a positive 
integer such that if 


n>m2>MmM 


then F, # F,,. Finally, let @ be a set of representatives of the group of 
(p—1)-th roots of unity in Z,, modulo +1. 

Given the prime | # p, there exist only a finite number of prime ideals in 
Q(n’) lying above /, as is seen immediately from the structure of the de- 
composition group. It follows at once that if 


Fe = ) Fis 
then there is only a finite number of prime ideals in F ,, lying above |. Let 2 
be one of these primes. We shall now indicate how to prove that given x, 
for all y with sufficiently large conductor p”*', we have 


5B, wy #0 mod. 


We can choose an integer m such that all primes of F,, above | remain 
prime in F,,, and we take m > my. Letn > m. 
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Suppose that $B, ,, = 0 mod &. Since 2 is the only prime of F,, lying 
above its restriction to F,,9, we conclude that 


Th, m(2¥(a)~ *By, zy) = 0 mod 2 


for all « € ZF. 
We shall now transform this congruence into more explicit terms. 


Lemma 4.3. If 3B, = 9 mod @ for infinitely many W (so of arbitrarily 


large conductor p"*'), then there exist infinitely many n such that for such 
W and all a € Z* we have 


Wa) 5 Pt 
d Y DY rxW,-n(an) + rp"-"*") = 0 mod &. 
neR r= 


Proof. Abbreviate T = T,,,,. From the irreducible equation of a p-power 


root of unity, we see at once that T(e) = 0 for any p-power root of unity ¢ 
which does not lie in F,,. Thus if B ¢ Z} and we write 


B = of B)<B>> 


where w is the Teichmuller character and <f>, = 1 mod p, we get: 


T(W(B)) # O<> W(ByP” = 1 
<= <B>," = 1 mod p"*! 
<> ¢B>, = 1 mod p"™"*?. 


Consequently we find 


TRW(a)~*By. yy) = 40" " ¥. xa(aa*) ra 


where the sum is taken for 
0<a<dp"*' and <a>, = <>, mod p"™*?. 
This can be rewritten 


-1 
TE Y(a)~ "By w) = Pes 2 Str, a) 
née 


where 


Sa=  D avava 


a=tya(pr-=™t 
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Note that given ny, elements a satisfying 


n+1 n—-m+1 


O0<a<dp and a=namod p 


can be paired with elements a’ such that 


n+1 n—-mt+1 


0<a'<dp and a’ = —na mod p 


of the form 


a’ = dp"*' — a. 
Since y is odd and w is even, we therefore find 


S(q, &) = ~ — [2xlaW(a)a — (a)(a) dp"*"*). 
) 


a=na(pr-mt! 


But the integers a satisfying the above conditions are precisely those of the 
form 


Sp—-m(an) + rp"-™** withO <r <dp™— 1. 
This concludes the proof of the lemma. 
To prove Theorem 4.1, we still have to deal with the possibility that 7B, ,y 


has a pole at &. However, we note that & has finite ramification index over I. 
Consequently for each y there is only a finite number of y such that 


1 
2Bi wy 


has a pole at &, because such terms contribute negative /-order to the class 
number h,, which can be cancelled only by a finite number of other factors 
in light of the congruence 


3B, wy #0 mod Lv 
proved for all but a finite number of yw. This concludes the proof of the 


assertion in Theorem 4.1 that in fact, all but a finite number of 5B,, »y are 
l-units. 
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In this chapter we prove that the Iwasawa congruences cannot be satisfied, 
thus giving a bound for the divisibility of Bernoulli numbers with characters, 
and hence a bound for the divisibility of the corresponding class numbers 
with respect to certain primes. 

The proofs closely follow Ferrero—- Washington [Fe-W] and Washington 
[Wa 2], except that Gillard [Gi 2] gave a simplification which we take into 
account. 


§1. Basic Lemma and Applications 


The impossibility of the congruences derived in the preceding chapter will 
follow from the next lemma, valid for some choice of representatives @ for 
B,-1 (mod +1). 


Lemma 1.1. Let d,mbe positive integers with d prime to p. For alln sufficiently 
large, there exists %,,a, € Z,,witha,,a, = 1 mod p™,and anelement ny € Z, 
having the following properties. 


(i) sy(%17) = S,—n(%1n) = 0 mod d for allne BZ; 
(ii) 54(%21) = Sp—m(%2") = Omod d for all n # No; 
(iii) 5,(%2o) = Sn—m(%2No) + p" ™*' =O mod d. 


The proof of this lemma will be given in the next sections. 

Although Ferrero—-Washington used similar lemmas for their theorems, 
Gillard [Gi 2] observed that the above single statement suffices in all cases. 
We now show its applications in each of the three cases under consideration. 
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The congruences of Theorem 2.4 of Chapter 10. We recall that these con- 
gruences are: 


(1) >, t,(a4)n” mod p is independent of « and n. 


ne 


In this case, we can take m = d = 1, and the congruences mod d become 
irrelevant, so that 


t(%19) = ty(%2n) =O for ally 4 19; 
tino) =O and t,(a2%) = 1. 


Subtracting the corresponding expressions in the congruences yields the 
contradiction. 


The congruences of Theorem 2.5 of Chapter 10. We recall that these con- 
gruences are: 


(2) ». 3 izx(s,(an) + ip"*!) = 0 mod p. 


ne& i=0 


The character x is odd, of conductor d or dp, and (d, p) = 1, while d 4 1. 
We take m = 2 in Lemma 1.1. Then we obtain for 7 4 no: 


5,(%19) = 9 = s,(%.9) mod p 
S,(%1n) = 0 = s,(%.9) mod d, 


and hence s ae = s,(a,y) mod dp. Since y has conductor d or dp, we get 
fori = 0,. 


X(s,(01n) + ip"*') = x(s,(a29) + ip”*?). 


Similarly, 


Sa(%2No) = Sx(%1N9) — p"*' mod dp, 
Let a = s,(@,N9). Then the congruence (2) yields 


d-1 d-1 


Y ix(at+ip"*!) = 2 ig(at+(i—1)p"*+) mod p. 


i=0 i=0 
But 
d-1 
X y(at+ip"*!) =0 
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because the conductor of x does not divide p"*'. Hence 
d-1 d-1 
YG + Dyx(atip"*') = ¥Y ix(a+(i—1)p"**) mod p. 
i=0 i=0 
Subtracting yields 
dy(a+(d—1)p"*') = 0 mod p. 


This is a contradiction because a+(d—1)p"*' is prime to dp. Indeed, it is 
obviously prime to p because it is =7) mod p, and on the other hand, since 
a = 0 mod d, we get 


a+(d—1)p"*! = —p"*! mod d. 
This concludes the proof. 


The congruences of Lemma 4.3 of Chapter 10. We recall that these con- 
gruences are: 


dp™-1 


1 
(3) a YY rxWs,-m(an) + rp"~"* *) = 0 mod 8. 


neR r=0 
For « = a, or % = &, we get forallne &: 
n+1 


a *(s,(an)+rp"-"*!) = y+rp""™** mod p 


Let a = s,(%M9). Arguing as in the preceding case, we get 


q dp™- 
qb, ilar ip’ Wo + ip" **) 
dp™-1 
= 9 YX ixat+(i-Vp""* (no + i— Dp" "**) mod &. 
i=0 


Again changing i to i+ 1 as in the preceding case, we get 
1 
apna + dp" —1)p"-"* Wo + (dp — 1)p""*") = 0 mod &. 


But a+(dp"—1)p"~"*' is prime to dp and no+(dp™—1)p""™*! is prime 
to p, so we get the desired contradiction. 
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§2. Equidistribution and Normal Families 


We recall some facts about equidistribution of sequences on R/Z. Let ¥ 
be a family of Riemann integrable functions of R/Z. Let CCF) be the vector 
space generated by ¥. We say that C( ¥ > is Riemann dense (in the space of 
all Riemann integrable functions) if given ¢ and given a real Riemann in- 
tegrable function g, there exist real functions f,, f, € C<¥) such that 


fAisash 


and 
I (hh —fi) <e. 
R/Z 


(In other words, we can approximate g above and below by functions from 
CF.) 
Let {y,} be a sequence of elements in R/Z. Consider the condition: 


EQU. Let ¥ be a family of (complex valued) Riemann-integrable functions 
on R/Z such that the vector space generated by ¥ is Riemann dense. Then 
for every function f in F, we have 


1 
FG) dx = lim = (f01) +--+ £On)). 
N-> oo 


R/Z 


By a three epsilon argument, one sees that if the sequence satisfies EQU 
for one family F, then it satisfies EQU for every such family. If that is the 
case, we say that the sequence is equidistributed, or uniformly distributed. 
Examples of such families which we shall use are as follows. 

The most classical family is the family of characteristic functions of 
ingervals [a, b) contained in [0, 1). Then equidistribution means that the 
density of n such that y, lies in [a, b) exists and is equal to the measure of 
[a, b). In other words, 


1 
lim — (number of n < N such that y, € [a, b)) = b—a. 


N>o 


In the application we deal with an even more restricted family, when the 
end points a, b of the intervals satisfy additional restrictions (rational 
numbers whose denominators are powers of a prime p), but the sequence still 
satisfies EQU with respect to this family. 

We shall also deal with the family of characters on R/Z, i.e., functions of 
type e?"'"’, which satisfies EQU. Then EQU is known as Weyl’s criterion. 
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The above criteria apply mutatis mutandis to r-space R'/Z’, using cubic 
boxes instead of intervals. 

Let {B,,...,8,} be a family of p-adic integers. The following two con- 
ditions are equivalent, and define what it means for this family to be normal. 


NOR 1. For every positive integer k and every r x k matrix (c;;) of integers 
with 0 < c;; < p—1, there exists n > —1 such that 


tn+ (Bi) = Cij 


fori=1,...,randj =1,...,k, and in fact the asymptotic density of such 


nis p-™. 


The condition means that every possible block of coefficients appears in 
the p-adic expansions of 8,,..., 8, with the expected frequency. 


NOR 2. The sequence 


(a nt+1 Sn(B1), - te 56.) n=1,2,... 


is uniformly distributed mod Z’. 


We shall now prove that these two conditions are equivalent. 
Let BeZ, and let C=(c,...,c¢,) be a k-tuple of integers with 


Denote the principal part 
c ¢ 
Pri(C) = + ep, 


Let I,(C) be the interval of real numbers [a, a + 1/p*) where a = Pr,(C). 
Write 


s,(B) = Sn—K(B) = oucteape 8 pane 2,D". 


Define the principal part 


Pr, (B) = a (5B) Om ()) eee 
p Pp 
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Then one verifies at once that 


: ; 1 
Pr,, (8) = Pr,(C) if and only if Pam Sa(B) € 1,(C). 


Applying this criterion to an r-tuple 6,,..., B,, we see that if the sequence 
in NOR 2 is equidistributed, then {f,,..., 8,} satisfies NOR 1. Conversely, 
we also see from the above criterion that if {8,,..., B,} satisfies NOR 1, then 
that sequence is equidistributed over intervals of type [a, a + 1/p*), with 
a = Pr,(C) as above. We can then apply EQU, with the family of character- 
istic functions of such intervals. 


Remark. As a special case, we also see that 1,(8) depends only on the 
interval [a/p, (a+ 1)/p) in which p~*"s,(B) lies. This remark will be used 
in the applications. For instance if p~"*"'s,(B) lies in the interval [0, 1/p), 
then t,(f8) = 0. If it lies in the interval [1/p, 2/p) then t,(B) = 1, and so forth. 


We use Haar measure on Z, (normalized to have total measure 1), and 
the expression “almost all” refers to all elements except on a set of measure 0 
for that measure. 


Lemma 2.1. Let {B,, ..., B,} be elements of Z, which are linearly inde- 
pendent over the rationals. Then for almost all xe Z, the family {aB,,..., «B,} 
is normal. 


Proof. By Weyl’s criterion, we must show that for every r-tuple of integers 
(a,,...,a,) not all 0, and almost all «, we have 


: 1 N r 1 
lim — > ( > wt s(aBa =), 
i=1 


No n=1 


where e(x) = e?*'*. (For each r-tuple we exclude a set of measure 0, but there 
are only countably many r-tuples.) Let 


r 


B= » a; B;. 


=1 
Since 
sap) = a8 = 3. s,(aB,)a, mod p"*? 


i=1 
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it suffices to show that 


: y (on sc) =0 


lim — 
noo N y= 
for almost all «. Let 
sin, a) = 1 ¥ of 1 sap) 
’ — nN H° pret 


Then writing |S(N, «)|? = S(N, «)S(N, «) we find: 


1 
+3%¥ [ (cross terms) da 
Zp 


1 
2 aed oe 
J, '8ev, 0 eee 


because the integral of the cross terms is equal to 0, since the integral of a 
non-trivial character over the group Z, is equal to 0. (In this case, the integral 


is a sum of roots of unity.) Thus we obtain 


0 


DY J1Scn?, a)? da= Ys 
m=1 m 


m=1 
By Fubini’s theorem, we can put the summation sign on the left inside the 
integral, and thus conclude that 
lim S(m?, «) = 0 
mo 
for almost all «. 
For arbitrary N, choose m such that m? < N < (m+1)’. Trivial estimates 


show that 


2 
|S(N, a) | < |S(m?, «)| + 70 as N — oo, 


This concludes the proof. 
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§3. An Approximation Lemma 


Lemma 3.1. Let {f,,...,8,} be p-adic integers, linearly independent over 
the rationals. Suppose we are given € > 0; an integer m > 0; an integer d 
with (p, d) = 1; real numbers x,,..., x, € (0, 1). Then for all n sufficiently 
large, there exists « € Z, satisfying : 


(i) a = 1 mod p”; 
(ii) |p~"* Ys, (aB;) — x;| < € fori = 1,...,r; 
(iii) s,(¢B;) = O mod d fori = 1,...,r. 


Proof. We use vector notation and put x = (x;,...,x,), B = (B,,..., B,). 
We let || || be the sup norm on the torus R'/Z’. For each n we define the 
residue 


res,(B) ae pre (s,(B 1), Mii ae 3 5,(B,))- 


We may assume that ¢ is so small that the intervals [x; — e, x; + e] are 
contained in the open interval (0, 1) for all i. Select N sufficiently large so that 
1/N < e/2d. By Lemma 2.1, for each r-tuple 


k = (ky,...,k,) 


of integers k; with 0 < k; < N—1 we can find a p-adic integer «, and an 
integer n, such that 


res, (%,B) — « | < é/2d. 


Let np = m + max, n, and let n > no. 
There exists some k such that 


| : a rs,(5) _ bd < é/2d. 


Let a’ = (1/d) + p"~™a,. Then «’ = 1/d mod p”, and 


res(5) + res,,(o% B) — reso) = 0. 
Hence the above inequalities yield 
| ; 3 reso) <2e/2d = e/d. 


276 


§4. Proof of the Basic Lemma 


It follows that 


X;—-& X;+é 
—(n+1) 'B. ee es ays 
Pp sieBde | d > d 


for alli = 1,...,r. 
Finally we let « = da’. Then s,(«B;) = ds,(a’B,), and satisfies the required 
conditions. 


The above proof, considerably simpler than the original proof, is taken 
from the Bourbaki report by Oesterle (Bourbaki Seminar, February 1979). 


§4. Proof of the Basic Lemma 


This section contains the proof of Lemma 1.1. 
Let n' be a primitive (p—1)th root of unity, and let its powers be 


. —1 
n’ forj=1,...,R where R=Pe. 


Then these powers 1/ represent the elements of p,_,/+1. 
We shall write y,,...,7, for +n',..., +n’, with any choice of sign, and 
r = o(p—1). We can express 


ths DY ani forj=r+1,...,R, 
i=1 


with integral coefficients a;,. Thus we obtain an (R—r) x r matrix 
A=(aj) (r+l1<j<R; 1<i<n). 


We let x,,..., x, be real numbers, and we then let 
e 
x)= Yagx, forj=r+1,...,R. 
i=1 


Observe that changing the signs of 7,,...,7, amounts to changing the 
signs of the columns in the matrix A. Changing the signs of ,41,...,1p 
amounts to changing the signs of the rows. Such changes of signs will be 
called admissible. 

Since n,/y; is not rational for i ¥ j, it follows that in each row, there are at 
least two non-zero elements. Furthermore, it is clear that no two rows of the 
matrix A are equal. We now prove: 
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Lemma 4.1. Let A = (a;;) be areal matrix such that no row equals + or — 
another, and in every row there are at least two non-zero elements. After an 
admissible change of sign, we can find a vector (x,,...,X,) ='XeR" 
such that, if we put 


RENE AM. Gad? KOH WS ess ccc), 
then: 


(i) We have x; > O for j = 1,...,R; 
(ii) We have x; # xj, for all j # j’. 


Proof. In R-space, we consider the conditions: 


x; = 0 for some j = 1,...,R; 
xX; — xX; = Oor x; + x; = 0 for some pair (j, j’) with j 4 j’. 


Each such condition defines a hyperplane. We want some X™ € R' such that 
(Xx, AX") 


does not lie in the union of these hyperplanes. Let V be the vector space of 
all vectors (X”, AX), ie. the graph of the linear map represented by A. 
Then V is not contained in any one of the above hyperplanes because of the 
two assumptions on the matrix 4. Hence V is not contained in the finite 
union of these hyperplanes. Let V’ be the complement of these hyperplanes in 
V, and let VG be the projection of V’ on the first r coordinates. Take (x,,..., x,) 
in the positive 2’-quadrant intersected with Vo, and let (x,,..., Xp) be a point 
in V’ above it. Since V is symmetric with respect to sign changes on the last 
R —r coordinates, we can then make such sign changes on x,,;,...,Xp to 
achieve the desired positivity condition. (I am indebted to Roger Howe for 
the above proof.) 


We let jo be the index such that x; < x,, for all j 4 jy. After replacing 
X1,.++5Xp by CX,,..., Xp for some real number c > 0, we may assume that 


O<x,;<p” forallj=1,...,R. 


We then apply Lemma 3.1, (i) and (ii) with B; = n; fori = 1,...,r. Letn and 
# be as in that lemma. If ¢ is small enough, we obtain 


OS aap sony <p ™ forzj rt... R: 
i=1 
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Hence 

0 < } ays,(an,) < p"**~™ < p"*?. 
Therefore 

Sn—m(ONj) = S,(0;) = x Aji S,(%N;). 


We take «, = « to satisfy the first part of Lemma 1.1. 
For Lemma 1.1 (ii) and (iii), we select the scaling factor c > 0 such that 


O<x,<p”™ forj#jo but p™<x;,,< 2p” 


We select « = «, in Lemma 3.1. Then the desired conditions are satisfied 
for j # jo as before, but for jg, Lemma 3.1 now shows 
n-m+1 


Sn—m(ON jo) = S(%Nj.) — DP 


This concludes the proof. 
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In Chapter 4 we developed the formalism of associated power series for 
measures on Z,,. It is necessary to develop it in general. We do this in the 
present chapter, which could (and should) have been done immediately 
following Chapter 4. In all this work, a prime p is given a special role. Values 
of functions lie in C,. In dealing with this composite case, it is also useful to 
follow Katz, and associate to a measure not only a power series, but an 
analytic function on the “formal multiplicative group.” This is explained in 
§2. The introduction of additional notation to handle this composite case, 
however, made it worthwhile to separate the two cases. Measures on Z, 
itself, without the extra d, occur both in their own right, and as auxiliaries 
to the composite case, so it is useful to have their properties tabulated 
separately. 

The present chapter is independent of everything else in the book, and 
can be omitted by those who wish to read at once the results of Chapter 13, 
needed to bound the plus part of the class number in terms of the minus part, 
for the Ferrero—Washington theorems. 


§1. Measures and Power Series in the Composite Case 


In Chapter 4, we dealt only with the formalism of measures and power 
series on Z,. To handle characters with conductor dp" where d is a positive 
integer prime to p, one has to deal with the composite case. Thus we now give 
an exposition of the formalism in this more general context. 

Let Z be a profinite group, equal to the projective limit of its quotients 


Z = lim Z/H, 
H 
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where H ranges over the open subgroups of finite index. Let 0 be a complete 
valuation subring of the p-integers in C,. We let the Iwasawa algebra be 


A,(Z) = lim 0 [Z/H], 
H 


where o[Z/H] is the group ring of Z/H over o. We recall that an o-valued 
measure on the projective system {Z/H} is a family of o-valued functions 
{Un}, Which is a distribution. This means: given H’ < H, we have 


Hy(x) = Sy Hy (y), 


where x € Z/H and the sum is taken over y € Z/H’ lying above x under the 
canonical map Z/H’ — Z/H. The association 


Myt> >) by(x)x € 0o[Z/H] 


xeZ/H 


lifts to an isomorphism between the additive group of o-valued measures and 
the Iwasawa algebra. 

Observe that the product in A,(Z) corresponds to the convolution of 
measures. 

If gis an v-valued function on Z, factoring through some factor group Z/H, 
in other words, if @ is a step function, then we define its integral 


fo du= DY 9(x)up(x). 


xeZ/H 


This value is independent of the choice of H, by the distribution relation. The 
integral extends to the space of o-valued continuous functions on Z by 
uniform approximation with step functions. 

Let C(Z, 0) be the 0-space of continuous functions of Z into o, with sup 
norm. There is a bijection between o-valued measures on Z and v-valued 
o-linear functionals 


A: C(Z, 0) > 0. 


Indeed, the measure yp gives rise to the functional 
Qr i; @ du. 


Conversely, given A, if x € Z/H and 9, is the characteristic function of the set 
of all y € Z such that y = x mod H, then we define 


Hy(x) > Mx). 
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It is easily verified that these associations are inverse to each other, and 
establish a norm-preserving bijection, where 


Hi] = sup | uy(x)| 


x,H 


is the sup norm. 
Now suppose that there is a finite subgroup Zp such that 


Z=Z, x Z,. 


Let F be a multiplicative group isomorphic to Z,, with topological generator 
y, so the isomorphism is given by 


zery’, with ze Z,. 
Let H, = {1} x p"Z,, and let y, = y mod T’’”. Then 
Z, = Z/H, = Zo XT, 


where I, = I'/T?” is cyclic, with generator },. 
Let X bea variable, T = 1+ X, and 


h(X) = (1+X)" = 1. 


Then the element of o[ Z,,] corresponding to py, is a linear combination 


(1) P(X)= > oy Pn, aM)OVn 


oeZo r= 


pr-l ir 


i de ane, 0, Jox « mod h,(X). 


aeZo r=0 k=0 


where y, = T mod h,. For each o, the map 

rE Mn, oN) 
determines a measure , on Z, as discussed in Chapter 4. This makes it 
possible to extend results concerning measures on Z, to measures on the more 


general groups now being considered. If we let 


pr-1 


P(X) = 2 Cn, pas 
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then the coefficients 


(2) Chk = >: > nl, )o = > Cn, ,(0)o 


lie in the group ring o[Z]. 

We let P(X) be the projective limit of the polynomials P,(X). Then P(X) 
is an element of o[Z,|[[_X]], and we shall write the correspondence between 
and the power series P by 


f=PH w= py. 


Let us write 


P(X) = Ya xt=Y VY qoXt 
k=0 


k=0 ceZo 


Then the coefficients c,_, are given by the integrals 


Q) cue (f) dets 


because we can apply Theorem 1.1 of Chapter 4. 
Let @ be a continuous function on Z. Then for each o, we get a function 


9(r) = 9(6,r) withreZ,. 


If g factors through Z,,, then 


(4) [ odu=¥ ¥Y o0,nu,(0,7) 


o reZ(p") 


=> i Po Ag - 
Z 


aeZo P 


Any continuous function @ on Z can be viewed as a family of continuous 
functions {~,} on Z,. Hence 


(5) [¢ du= ¥ [% du,. 


aéZo 


Using Mahler’s Theorem 1.3 of Chapter 4, if we write 


x 
Q(x) = = an(*) 
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and the power series associated with p, is 


(6) SAX) a Spek 


then 
(7) [ode = DE enotne 
Z oon 


The sum formula (5) in principle allows us to reduce the study of any 
measure p, to the individual measures associated with the power series f, 
on the fiber {o} x Z,. The formulas of Chapter 4 apply to each f,. In addition, 
we have trivial ones relating to the extra factor Z). For instance: 

(8) If @ is a function on Zo, then 
Ply = Hy where g, = 9(9)f,- 


This applies in particular to the characteristic function @, of a single element 
a € Z,. Then 


Palsy = Uy, 


where f, is identified with the element f,-« in Z,[Zo][[X]]. In this last 
example, we have of course the Fourier expansion of @, given by 


l 


Q, = 1Zl d Way, 


where the sum is taken over all characters w of Z. 
At any finite level, that is on any one of the groups 


Zo x Zp"), 
the space of functions is generated by the product functions 
Po @ Pp, 
where @p is a function on Z, and @, is a function on Z(p"). By definition, 


(Po ® Pp)(G, Xp) = Po(F),(Xp). 
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Thus to test whether two measures are equal, it suffices to test whether 
their integrals on such functions are equal. These integrals decompose as 
simple sums according to (4) and (5), namely 


(9) [ eotoro,) duo, Xp) = » P(e) i Pp(Xp) du,(x,). 


This applies for instance if g@ is a character of Z which factors through a 
finite level. 

Let f be the power series associated with yw. Given the function @, on Zo, 
the expression of (9) defines a measure py, on Z, whose associated power 
series is 


foo = 2 Pol) fo, while Hp, = Y) Po()He- 


In other words, we have the formula 


(10) [ @ Oy dus = ) ~,du, where g = f,,. 
Zz Zp 


If Z) = Z(d) = Z/dZ, then we may take for @, an additive character, 
which is of the form 


Wor Xor> 0°, Xo € Z(d), 


for some d-th root of unity Cy. In that case, the measure p, above will also be 
denoted by 


Hy OF  Myg 


if Wo is the above character. 
Let N = dp". An N-th root of unity ¢ has a unique product decomposition 


C= Cove 


where Cy is a d-th root of unity, and C, is a p"-th root of unity. Then we have, 
by definition and (10): 


(11) iF CxeCE—p (xp) dulx) = [ C3009 x») dig (%,p). 
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In particular, for g, = 1, 


(12) [coe nee [ C2 deal Xp) 


where 4;, is the measure whose associated power series is 


(13) foo= YL OSfe0: 


xo € Z(d) 


For this formalism it is therefore useful to define x € Z by its two components, 
X = (Xo, Xp), where x9 = x mod d, and x, = x mod p”. Then 


om CBee, 


so that the above integral can be written more simply with ¢*. 
By the orthogonality of characters, it is then immediate that we can 
recover f,, from f;, by means of the formula 


1 
(14) fay = G50 


§2. The Associated Analytic Function on the 
Formal Multiplicative Group 


Katz [Ka 3] has shown that in addition to the power series associated with a 
measure, it is also useful to associate an analytic function on the “formal 
multiplicative group.” The formalism is similar to the one of power series 
developed in Chapter 4, but it is more convenient in some situations, 
especially when dealing with the extra factor Z(d) in the composite case. 
Again, Katz’s formalism makes certain constructions due to Iwasawa and 
Leopoldt appear completely natural, and we reproduce this formalism 
below. 

We fix a positive integer d prime to p. In the sequel we let N denote any 
positive integer of the form dp", and again we let Z = Z(d) x Z,, so that 
Zo = Z(d). Let 


T=1+xX 


as usual. Let m be the maximal ideal in o¢,. A function R on 1 + mis called 
analytic if there exists a power series f with coefficients in C,, converging 
on m, such that 


RG +z) = f(z) forallzem. 


286 


§2. The Associated Analytic Function on the Formal Multiplicative Group 


Let y be a d-th root of unity. We say that R is analytic on n(1 + m) if there 
exists a power series fas above such that 


R(y(i + z)) = f(z) forall zem. 
Then fis uniquely determined by R, and is said to be associated with R. 
Example. The function log T = log(1 + X) is analytic in the above sense. 
In the applications, we shall deal principally with this log, or with rational 


functions. 


We let G,(d) be the (disjoint) union 
G,(d) = LU ni +m). 
n@=1 


A function R is called analytic on G,(d) if it is analytic on each “component” 
n(1+m). Then R consists of a family {R,} where each R,, is analytic on the 
corresponding component. If ¢ € y(1-+m), we use the notation 


C= mu, n = Cy = a), u=(,= <a a 
Then 
R(Q) = Rag = Sols 1). 


We shall call G,(d) the formal multiplicative group (at p, of level @). 


Remark. The group G,(d) may be viewed as the group of (continuous) 
C}-valued characters on Z = Z(d) x Z,, by the mapping 


Wr: xr> Cr = coGe 
for each € € G,(d). 
Let be a measure on Z, with values in v. An analytic function R on G,(d) 


will be said to be associated with y if the power series associated with R have 
coefficients in 0, and if we have the formula 


[ C dux) = RO 


for all N-th roots of unity ¢, N = dp” (all n). The Weierstrass preparation 
theorem shows that an associated analytic function R, if it exists, is uniquely 
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determined by yu. Of course, we want the above formula to be true also for all 
¢ € G,(d), but we can prove it later, when ¢ is not a root of unity. 

We shall say that y is rational if R is a rational function of T (and hence 
of X). Thus the right-hand side R({) is just the value of this rational function 
at T = ¢. We then call R the associated rational function. We also observe 
that the rational function is the same (if it exists) as that associated with the 
restriction of uz to 


{0} x Z,. 


Indeed, we have from the definition of a distribution: 
RG) = | cpr dua) = | o5 dul, xp. 
Zz P 


R 1. A measure always has an associated analytic function R. If d = 1 and 
Lt is a measure on Z,, then 


R(T) = f(X) 
is the associated power series of Chapter 4. 


Proof. For a measure on Z, with d = 1, this follows from Meas 0 and 
Meas 2 of Chapter 4, §2. The general case then follows at once. Note that in 
Meas 0, we have 


F(0) = R(). 
Furthermore, if is a measure on Z = Z(d) x Z,, then the measure p;, 
at the end of the last section is a measure on Z, to which we can apply R 1, 


with d = 1. 


R 2. If a measure u on Z has an associated rational function R(T), then the 
measure lz, has an associated rational function, given by 


R(T) = RCoT) = fro X). 


Proof. This is immediate from the definition of the associated rational 
function, and the integral formula (12) of §1. 


Lemma 2.1. Assume that pu is a measure on Z such that each py, is rational, 
for every d-th root of unity €). Assume also that the functions 


R;(6o 'T) = R(T) 
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are independent of Cy. Then jis rational, and its associated rational function 
is R(T). 


Proof. This is again immediate from the integral formula (12) of §1. 


R 3. Let pt be a measure having associated analytic function R. Then the 
formula 


[eo du(x) = RO 


holds for any C € G,(d). 


Proof. This is a direct consequence of R 1, formula (12) of §1, and the 
analogous result for measures on Z, stated in Theorem 1.2 of Chapter 4. 


R 4. Let € € G,(d). Let R be the associated analytic function of pu. Then 
€*u(x) has associated analytic function R(CT). 
Proof. Immediately from R 3, since (7¢3 = (¢,¢,)*. 
Just as with associated power series, R 4 allows us to use the Fourier 
expansion of any step function to obtain its associated analytic function. 


Thus let @ be a step function of level M which is a divisor of dp" (possibly 
a pure power of p). Then 


x)= Y HOC 
eMe=l 


and 


1 3 
90 = 79 D 9K 


xeZ(M) 


We shall now give a first example of the use of such expansions in connection 
with the unitization operator U defined by the formula 


UR(T) = R(T) — Z > R(CT). 
Pc=i 


Let 
Z* = Z(d) x Z*. 
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R 5. Let @ be the characteristic function of Z*. If R is associated with p, 
then 


UR is associated with ou. 


Proof. The Fourier expansion of the characteristic function of Z¥ was 
already computed in Chapter 4, §2 and is trivially determined to be given by 


-i/p ifC #1. 
$0) =4 5 

Dee ees 

P 


and ¢ ranges over p-th roots of unity. Property R 5 then follows from R 4 and 


the Fourier expansion. 


R 6. Let N = dp" wheren > 0. Let x be a Dirichlet character of conductor N, 
and let € be a primitive N-th root of unity. If R is the analytic function 
associated with p, then the analytic function associated with xp is 


SD aRE-T) 
ae Z(N)* 


where 


Sx, = ¥ aye. 


Proof. The computation of the Fourier transform of x is routine and is 
left to the reader. (One has of course to use Theorem 1.1 of Chapter 3, §1.) 


R7. Let R be the analytic function associated with p. Let D = TDy and 
let k be an integer =>0. Then 


x‘ u(x) has associated analytic function D‘R(T). 


The same statement holds if yu is rational, and “analytic”’ is replaced by 
“rational” in the above statement. 


Proof. We may use 9,(x,) = x* in (11) of §1, and then apply Meas 6 of 
Chapter 4, §2 to each one of the measures y/,,, after using R 1. Thus the general 
case is reduced to the special case of measures on Z,. 
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R8. Let R = UR be the associated analytic function of a measure p with 
support in Z*. Then the measure 


xp 'u(x) has associated analytic function UH, 
where H is any analytic function on G,(d) such that 
DH = R. 


Proof. Since x, ‘ (x) is a measure on Z*, there exists an analytic function F 
associated with it. Then R 7 yields 


DF = R. 


We can let H = F. Since the kernel of D consists of the functions which are 
constant on each component of G,(d), if we select any H such that DH = R, 
then UH will have the same value, independent of the choice of H, thus 
proving the desired property. 


The “integration” of the analytic function R(T) can be performed formally 
in terms of the variable X. Indeed, say on the coset 1+, if R is defined by a 
power series f(X) then an analytic function H such that DH = R is given in 
terms of X by 


dX F(X) 


B= fron Sy tx ote 


dX = fan. 


Remark. The formalism of this section is set up to extend at once to its 
adelization over d prime to p. 


§3. Computation of L, (1, x) in the Composite Case 


Let E, , be the measure defined in Chapter 10, §2, giving rise to the p-adic L- 
function and the Bernoulli distribution, regularized with c so as to be integral 
valued. We shall apply the previous considerations to this measure. 

By definition, we have for s = 0 the value 


-1 
L,(, ) =———-~ ae ; 
Wh) = pes | adap AE, a) 
The conductor of y being dp" = N, the integral might as well be taken over 
Z** = Z(d)* x ZF. 
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In any case, if we find (as we shall do below) that E, , has an associated 
rational function, then the general formalism also yields successively the 
corresponding analytic function for the measure 


Maja, * Ey, (a). 


We may then evaluate this analytic function at T = 1 to get the value L,(1, x). 
We shall now carry out on Z(d) x Z, the same analysis that we did for 
Z, in Chapter 4, concerning the associated power series, and the Leopoldt 
formula for the value of the L-function at s = 1. 
We recall that if c is a positive integer prime to dp, then the measure 
E,,, on Z, has an associated rational function (equal to its associated power 
series by R 1), which is 


1 c 
R, {T)=—— - 
14 ) Fs | Te —1 


according to Proposition 3.4 of Chapter 4. 


Proposition 3.1. Let c be a positive integer prime to dp. Then E,,, on 
Z(d) x Z, has an associated rational function, equal to R,, (T) above. 


Proof. To extend the above result from Z, to Z(d) x Z,, it suffices to 
prove that for every root of unity ¢ € py, and N = dp", we have 


{ C dEy (x) = Ry, AO. 
Z 


By definition, essentially (cf. formula B 6 of Chapter 2, §2) we know that for 
any function g on Z(N) we have 


=1 Ze% 
X00) 


a) Z N 
Kh 1226 


We apply this to g(x) = C* and g(x) = ¢‘*. Summing a geometric series from 
0 to N—1 then yields 


Al 1 ze 1 c 
=B,. —=—B, ...) —— = eS 
Py (; i eae ad G20! fel. ere’ 


where T = e”. The right-hand side at T = 1 is the same as R,, ,(£), and is also 
the same as the left-hand side at Z = 0. But that is precisely the value of the 
desired integral for k = 1. This proves Proposition 3.1. 
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Proposition 3.2. Let x have conductor N = dp" with n> 0. Then xE,,, 
has an associated rational function R,,, given by the formula 


R,, AT) = G(T) — cx(c)G,(T*), 
where for any primitive N-th root of unity ¢, 


800 << 
GT) = N xh CT 1° 
Proof. Special case of R 6. 


We can write R, .(T) in full in the form: 


R, (T) = Sa.) 6) Bz zal (25 | 


5.08 T — c 


This puts us in the position of applying R 8, and of finding an analytic function 
H,,, such that H, (1) = 0, and 


We let 4 range over c-th roots of unity, and we let 


eC eee Y Y © log = 


N A#1 aeZ(N)* —Att 


Exactly the same verification as in Chapter 4, §3 (or a direct integration using 
partial fractions) shows that this value for H, ,(T) satisfies our requirements. 

By R 6, the analytic function associated with the measure y(a)a > Ey, .(a)is 
UH, ,(T). 


Theorem 3.3. Let x have conductor N = dp", n > 0. Let € be a primitive 
N-th root of unity. Then 


Lyfts x) = ~ (1 = ME) SD Y Ha) log (1 — 6). 


N aeZ(N)* 
Proof. Let € range over the p-th roots of unity. By R 6, 


— 1 —x(©))L,C, x) = UH,, (1). 
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Hence, following exactly the proof of Chapter 4, Theorem 3.6: 


1 va 
ae ad Pe x(c))L,C, ya) = P te a 2 » > x(a) woe, = 


a A#] 
1 S(x% 9) g— ACP 
- 158.99 yom (1) 11 =) 
But 
E-Ate ae 
am I 1—Ags o (1—AC*)P 


If p| N, then as in Theorem 3.6 of Chapter 4 we find that 


> Xa) log,(1—Ac*?) = 0, 


and the rest of the proof is identical with the previous one. If p/N, then we 
change variables, letting a+ p- ‘a mod N. We then find 


Y Ha) log 1 —Al*”) = x(p) Y x(a) log, (1 — AL"). 


Thus we get an extra term besides that of Chapter 4, which gives rise to the 
factor (1 — y(p)/p) as stated in the theorem. Except for that, the proof is again 
identical with the previous one. 
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Classical results of Kummer give bounds for certain class numbers and 
estimate, for instance, the plus part in terms of the minus part for cyclotomic 
fields. 

Such estimates have been carried out more systematically and generally 
by Iwasawa, to include estimates for his invariants, and will be given in the 
first two sections of this chapter. 

Next we deal with the /-part of the class number in a cyclic extension of 
degree p” when p is a prime #1. We also give examples of Iwasawa of non- 
cyclotomic Z,-extensions when the order of the class number grows expo- 
nentially. These examples are based on a classical formula (Takagi- 
Chevalley) expressing the fact that in a highly ramified extension, the ramified 
primes have a strong tendency to generate independent ideal classes. 

We conclude with a lemma of Kummer which is still somewhat isolated. 
In this connection, cf. [Wa 5]. 


§1. Iwasawa Invariants in Z,-extensions 


Let K be a number field and K,, a Z,-extension. We let K, denote the sub- 
field of degree p" over K,so Ky = K. 

Let C,, be the p-primary part of the ideal class group of K,,. We also write 
C(K) for the p-primary part of the ideal class group of K. Then by Iwasawa 
theory of Chapter 5, we have 


|C,| = p*” where e, = mp" + An + O(1). 


We call m, A the Iwasawa invariants of the Z,-extension. We call m the 
exponential invariant, and / the linear invariant. We indicate the dependence 
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of m and / on the Z,-extension by the notation. 
m=m(K,/K) and 4= A(K,,/K). 


If K,, is the cyclotomic Z,-extension, then these invariants depend only on 
K. We may then write more briefly 


m=m(K) and A= A(K). 
Put 
Vz = C(K.,/K) = lim C(K,). 


From the structure theorem of Chapter 5, we have a quasi-isomorphism 


Va ~ I] A/p™ ® Il Al fj. 


For any abelian group V we let V“” be its p-primary part, i.e. the subgroup of 
elements annihilated by a power of p. Then we have quasi-isomorphisms 


Ve ~ [][ A/p™ and V/V ~ [] A/f;. 


From the structure Theorems 1.2 and 1.3 of Chapter 5, we know that 


MK ../K) = ¥ deg fj. 


Furthermore, we have the characterization: 


A is the rank of Vx/V%) as a finitely generated Z,-module. 


For the exponential invariant, we have 


We shall now compare these invariants in a Z,-extension and its lifting over a 
finite extension. 

First note that if F < K and F,, is a Z,-extension of F, then K,, = KF, 
is a Z,-extension of K, and there is some integer r such that 


K, = KF,.,. 
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Lemma 1. Let F < K be number fields. Let F ,, be a Z,-extension of F. 
Then 
mF ,,/F) < m(KF,/K) and A(F,/F) < AUKF,/K). 


Proof. The degree [K,: KF, ] is bounded by a fixed power p’. By class 
field theory, we have 


(CF): Nx,yr,C(Kn)) S LK: Fp’. 
Hence 
IC(K,)| > |C(Fa)I, 
and therefore 
ord, C(K,) => ord, C(F,,) — O()). 


This proves the first inequality in light of the formula for the orders. 
For the second, let us use the functorial notation 


C(F./F) = lim C(F,). 
The norm maps 
Nx,jr, > C(K,) > CCF) 


are compatible with the norm maps in the projective limits in the tower 
over F, and Kg respectively, and thus induce a homomorphism 


N:C(K,,/K) > C(F,/F). 


Since the index of the image of Nx,/-, is bounded for all n (as in the first 
part of the proof), it follows that the image of N in C(F,,/F) is of finite 
index, in other words that N is quasi-surjective. 

Now put V; = C(F,,/F) and similarly for Vg. Let V as usual denote the 
subgroup of elements annihilated by a power of p. Then we have a quasi- 
surjective homomorphism 


Vg/VE) > Ve/VE. 


The inequality for the A-invariants then follows at once since A is the rank 
of the above modules as finitely generated Z,-modules. 
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We have been concerned with the p-divisibility of the order of the ideal 
class group in Z,-extensions. An analogue of Lemma 1 can be given for any 
prime number. 


Lemma 2. Let F,, be a Z,-extension of a number field F. Let K be a finite 
extension of F and let K,, = KF,,. Let | be any prime number. If 
ord,|C(K,)| is bounded, then ord,|C(F,,)| is bounded. 


Proof. The argument using the norm index in the first part of the proof 
of Lemma 1.1 applies equally well to prove the result stated in Lemma 1.2. 


Next we study another Iwasawa invariant, the rank. 
If A is an abelian group, we define the p-rank, 


rank,(A) = dimension of A/pA over the prime field F,. 


We have a criterion for the vanishing of the exponential Iwasawa invariant 
in terms of this p-rank of the groups C,. We phrase the criterion to apply in 
general to torsion modules over the Iwasawa algebra. 

We use the notation of Chapter 5, Theorem 1.3, where we dealt with a 
module of Iwasawa type, and say 


V ~ [[ A/p™ ® [I A/F, 
i=1 j=l 


where the f; are distinguished polynomials. We let 
nVy=r 


be the number of factors of type A/p” for some m. As in Chapter 5, we let 
v1,.-., Vv, be elements of V such that if we put 


U, = Z,-submodule of V generated by (y — 1)V and 1,,..., v,, 
U, = 9,Uo with g, =1t+ y+. + p74, 


then 
Vy, =V/U, 


is finite for alln. We pute, = e(V,) = ord, V,. The proofs of Theorem 1.2 and 
Theorem 1.3 of Chapter 5 distinguish the two cases of modules of type 


A/p™ and A/f 


where f is distinguished. They immediately show the following result. 
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Lemma 3. We have rank, V, = r,;(V)p" + O(1), where r,(V) is the number 
of factors of type A/p™ above. Furthermore, the following conditions are 
equivalent. 


(i) All the factors A/p™ are equal to 0, sor,(V) = 0 and 


V ~ [I] AMS). 


(ii) The p-rank of V, is bounded independently of n. 
(iii) In the order formula e, = mp" + An + O(1), the exponential in- 
variant m is equal to 0. 


§2. CM Fields, Real Subfields, and Rank Inequalities 


We now wish to make a comparison of the behavior of the ideal class group 
in a field and a real subfield. We prove Kummer’s theorem (Theorem 2.2) 
and Kummer type theorems in the following context. 

A number field K is said to be a CM field (complex multiplication field) 
if it is a totally imaginary quadratic extension of a totally real field. We 
leave it as an exercise to prove: 


K is a CM field if and only if the following condition is satisfied. Let p 
be complex conjugation. Then po = op for all embeddings o of K into the 
complex numbers, and K is not real. 


The totally real subfield of K is then uniquely determined, and denoted by 
K*. It also follows that a composite of CM fields is a CM field. 

Although we are primarily interested in the cases when the CM field is 
abelian over the rationals, it is just as easy to deal with the more general 
case. However, we have to restate some results of Chapter 3 in this context. 

Let K bea CM field. Let W, be the group of roots of unity in K. The Hasse 
index Q, can be defined as for abelian fields over the rationals, namely 


Ox = (Ex: Wk Ex), 
where Ex is the group of units in K, while Ex = E(K*) is the group of units 
in the real subfield. 
We have: 


Lemma 1. Q, = 1 or 2. 


The proof given for Theorem 4.1 of Chapter 3 applies here. In fact one verifies 
immediately that the map ut> ii/u gives an injection 


Ex/WEK > Wk/Wr. 
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Lemma 2. Let K be a CM field and p a prime. Let C'(K) be the p-primary 
part of the ideal class group of K. 


(i) The natural map C”(K*) > CK) is injective if p is odd, and its 
kernel has order | or 2 if p = 2. 
(ii) The norm map 


Nxjx+ : C(K) ar a C(K*) 
is surjective. 


Proof. For p odd this is a special case of the similar (trivial) lemma con- 
cerning ideal classes in extensions whose degree is prime to p. The proof 
will be given in full in the next section when we deal with this case for its own 
sake. 

Suppose p = 2. Let a be an ideal of K* and suppose a = (a) with w in K. 
Then &/a is a unit, and in fact a root of unity (the absolute value of all its 
conjugates is 1). We had defined above the map —: Ex > Wx by urrii/u. 
The association at &/« then gives a well defined map 


Ker[C(K*) > C(K)] > Wy/o(Ex), 


which is immediately verified to be injective. Hence the kernel has order 
1 or 2. This proves (i). 

The proof of (ii) is identical with that given for Theorem 4.3 of Chapter 3. 
It is a routine lemma of class field theory. 


The eigenspace C(K)™ is the kernel of the norm map Nx + in C(K), 
and for an odd prime p, we can identify the p-primary parts 


COCK *) as C°)K) fe 
Unless otherwise specified, we continue to assume that p is an odd prime. 


Let K = Ky be a CM field, and let K,, be a Z,-extension such that each 
K,, is a CM field. 


Remark. As Washington observes [Wa 2], if Leopoldt’s conjecture is 
true, then K,, is necessarily the cyclotomic Z,-extension, as follows from 
Theorem 6.2 of Chapter 5. 

The real subfield KZ is a Z,-extension of Kg. It is then clear that 

K, = KK3, 
so K,, is the lifting over K of the Z,-extension K% of Kg. 
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Let C, = C?) be the p-primary part of the ideal class group of K,. We 
have the orders 


- - os 
ord, C, = en ord, C, =e, ord, Cr = en, 


and e, = e, + e,. Similarly, we have the Iwasawa invariants associated 
with the A-modules 


C = lim C,, C” =limcC,, C* =lim Cy, 


and we denote these invariants by m, A with a plus or minus sign as superscript 
corresponding to the two cases. Then 


m=m +m* and A=A> +A". 
Indeed, we have for an odd prime p, 
C=C Or": 
Remark. For p = 2 one has to be more careful, and for instance, one has to 


distinguish C(K*) from its natural image in C(K), which we might denote 
by C*. Cf. Lemma 2, and also Lemma 4.1 below. 


Theorem 2.1. Let p be an odd prime. LetK be aCM field, and assume that 
the p-th roots of unity are in K. Let C be the p-primary part of the ideal class 
group of K. Then 

(i) rank, C* <rank,C™ + 1. 


Let W, be the group of roots of unity in K. If K(W}/”) is ramified over K, 
then 


(ii) rank, C(p)* < rank, C(p)”. 


Proof. Let L be the maximal abelian extension of K of exponent p. Let 
G = Gal(L/K). By class field theory, 


GxC(p) and Gt = C(p)*. 


Since the p-th roots of unity are assumed to be in K, the extension L is a 
Kummer extension. Let B be its Kummer group containing K*?, so that 


K* > B=>K*? and L= K(B"®), 
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We have the exact Kummer duality 
G x B/K*? > 4y,. 
Since p is assumed to be odd, we have a direct product decomposition 
G=Gt*xG. 
For simplicity of notation, let 
V = B/K*?=VtxV-, 


Since p, is a (—1)-eigenspace for complex conjugation, it follows that we 
have an exact pairing 


G* xV >p,, 
and therefore 


rank, G* = rank, C(p)* = rank, V~. 


On the other hand, if b € B we know that K(b!’”) is unramified, and hence 
there exists an ideal b of K such that (b) = b?. The map b+ Cl(b) gives 
rise to a homomorphism 


yo: V = B/K*’? +C,, 
which induces a homomorphism 
gy :V>>C, 


of V~ into C; because g commutes with complex conjugation. One then 
verifies immediately that we obtain an injective map 


Ker pg > Ex(p), 


by writing be B as b=a?u with ue Ex and mapping btu. Since 
(Ex: We Ex) = 1 or 2, we obtain 


Ex(p) = Wx(p)” = Wx(p) 
because Wy = Wx. Therefore 
rank, Wy(p) = 1 = rank, Ex(p) , 


and 


rank, C(p)* = rank, V™ < 1 + rank, C,. 
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This proves the first assertion. Furthermore, if K(W}/”) is ramified over K, 
then B does not contain Wx, and consequently 


Kerg =1. 


Hence the last inequality on the right can be replaced by the stronger in- 
equality 


rank, V™ < rank, C’, 
thus proving the theorem. 
The next two results are really corollaries of the theorem, but we label 
them theorems in their own right in view of their importance. The first is a 


classical result of Kummer. 


Theorem 2.2. Let K = Q(u,), and let h, be the class number of K. If 
h, is prime to p, then h; is prime to p, and so h, is prime to p. 


Proof. Obvious, since the p-rank is 0, and the stronger of the two in- 
equalities applies. 


Let K,, be a Z,-extension of Ky such that each K,, is a CM field. Then 
C, C*, C~ are modules over the Iwasawa algebra, and thus we have the 
invariants 

r(C) =r,(C*) + 7r,(C_) =ri +r;. 
as defined in §1. 


Theorem 2.3. Let K,, be a Z,-extension of Ky (p odd), such that each K, 
isa CM field. Then 


rt <rj- 
In particular, if m~ = 0 then m* = 0. 
Proof. Immediate from Theorem 2.1 and Lemma 3 of §1. 


For the prime p = 2 the estimates are not as good, but one has a result 
of Greenberg (cf. Washington [Wa 2]). 


Proposition 2.4. Let K be a CM field, and C = C(K). Then 
rank, Cx+ < ord,|Cx|+ 1. 
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If K,, is a Z,-extension of Ko such that each K, is a CM field, and if 
m~ =Othenm* = Oalso. 


Proof. Let r = rank, Cx+. Then by Lemma 2, 
2” = (Cx+: Ch+) = (NCx: NC*), 


where C* denotes the image of C(K*) in C(K). This last index divides 


ICx| 
Cpe G*) = 
( K ) ict iy 
which by Lemma 2 divides 
21Cx| i 
= 2h. 
IC(K*)|  * 
This proves the inequality. The statement about m~ = 0 then follows from 


the structure theorem, cf. Lemma 3 of §1, as for p odd. 


§3. The /-primary Part in an Extension of Degree Prime to / 


In this section we prove a lemma of Iwasawa used by Washington [Wa 2]. 
We begin by a trivial remark. 


Lemma 1. Let F be a number field and K a Galois extension of degree d. 
Let | be a prime number not dividing d. Let Cr denote the |-primary part of 
the ideal class group. Then the natural homomorphism Cy; — Cx is injective, 
the norm Nxjp:Cx > Cr is surjective, and the following conditions are 
equivalent: 


(i) Cr = Cx. 
(ii) rank, Cp = rank, Cx. 
(ili) The norm Nxjp: Cx) > Cr) is an isomorphism. 


Proof. For the first assertion, suppose an ideal a of F becomes principal 
in K, say a = (a). Taking the norm yields 


at = (Nx)r®), 


and since d is prime to I, it follows that ais also principal. Abbreviate the norm 
by N. Since 


NCE —_ ci = Cr, 
it follows that the norm is surjective. 
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It is clear that (i) implies (ii). Assume (ii). Then the norm in (iii) is an iso- 
morphism because it is surjective. It is also a G-isomorphism, where G = 
Gal(K/F). Hence G acts trivially on C,(J), and therefore 


N = d- identity on C,(I). 


It follows that C,(/) = C,(l). Nakayama’s lemma or the structure theorem 
for abelian groups concludes the proof that Cp = Cx. 


Lemma 2. Let p be a prime number #1. Let K,, be a cyclic extension of 
a number field Ky, of degree p". Let f be the order of | mod p”. Let C, be 
the l-primary part of the ideal class group in the subfield of degree p’. Let 
D,, be the kernel in the exact sequence 


Norm 


09 D,> CQ —— Gai 0. 
Then D,, # 0 if and only if C,, # C,—1, and in that case, 
dim D, >= f. 


Proof. Let G = Gal(K,/Ko). We have a representation of G on the 
Z(1)-vector space D = D,, and we first show that if D # 0, then the rep- 
resentation is faithful. If not, it is not injective on the unique cyclic subgroup 
of order p, namely Gal(K,/K,_,). Hence it is trivial on that subgroup, and 
therefore 


Nu.n-1D = pD = 0, 


whence D = 0. 
Now assume D # 0. Then the tensor product of D with the algebraic 
closure of F, splits as a G-direct sum 


D@¥Fi=@D,, 


where the D; are irreducible components, of dimension 1. If the operation 
of G on every D; is not faithful, then it is not faithful on the unique subgroup 
of order p, so not faithful on D itself. By what we have shown, it follows that 
G operates faithfully on some D;,. Hence G operates on this D;,, by a rep- 
resentation into the p"-th roots of unity, and a generator of G goes to a 
primitive p"-th root of unity ¢. But D is defined over F,, so the conjugate 
representations by means of the conjugates ¢', ¢”, ... occur. So exactly f 
distinct conjugates occur among the D;,sodim D > f. This proves the lemma. 
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This last part of the argument proves the general statement: 


Let D be a finite dimensional representation of a cyclic group G of order 
p", over the prime field F,. If D is faithful, then 


dim D > f, 
where f is the order of | mod p". 
Washington’s application of the lemma then runs as follows. 


Theorem 3.1. Let K,, be a Z,-extension of Ko, and let C,, be the I-primary 
part of the ideal class group in K,,, where | is a prime # p. If the I-ranks 
of C,, are bounded, then the orders of C,, are also bounded for all n. 


Proof. Otherwise, we must have D, # 0 for arbitrarily large n, and the 
order f,, of 1 mod p” satisfies 


A ie 


The preceding lemma would then imply that the ranks tend to infinity, a 
contradiction, which proves the theorem. 


Remark. The lemma need not only be applied to a Z,-extension. For 
instance, given a number field F, and a prime number |, if K is cyclic of degree 
p over F, then either the /-rank of Cx tends to infinity, or Cp = Cy as p> ©. 
It would be interesting to investigate for what sequence of cyclic extensions 
of degree p does the /-rank remain bounded, or tends to infinity. 


Theorem 3.2. Let K,, be a Z,-extension of Ko. Assume that each K,, 
isa CM field. Let | be a prime number # p, and assume that the I-th roots of 
unity are in Kg. If ord,|C, | is bounded, then ord,|C, | is also bounded. 


Proof. By Theorem 2.1 the /-rank of C7 is bounded, so the /-rank of C, 
is bounded. Then Theorem 3.1 concludes the proof. 


§4. A Relation between Certain Invariants 
in a Cyclic Extension 


The result of this section will be used later to give examples due to Iwasawa, 
of Z,-extensions in which orders of ideal class groups tend rapidly to infinity. 
Results like the first lemma are classical. The prime power case was known 
in the last century, and the general cyclic case is in Takagi and Chevalley’s 
thesis on class field theory. 
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Let K be a cyclic extension of a number field F. Let G = Gal(K/F). For 
each (normalized) absolute value v of F we let e(v) be the ramification index 
of v in K. If v = v, for some prime ideal p of F, then e(v) = e(p) is the usual 
ramification index of p in K. If v is Archimedean, then e(v) = 1 or 2 accord- 
ing as the local extension is trivial or of degree 2 (complex numbers over the 
real numbers). We put 


e(K/F) = [J e(v). 


v 


Then e(K/F) = eo(K/F)e,,(K/F), where 


eo(K/F) = [J e(p) and e(K/F) = [| e(v.). 
p 
We let E denote the group of units and C the group of ideal classes. If G 
acts on a module A, we let A% be the submodule of elements fixed under G. 
The next lemma implies that highly ramified primes have a tendency to 
generate independent ideal classes, and that the obstruction to this is con- 
tained in some cohomology. 


Lemma 4.1. Let K/F be acyclic extension with Galois group G. Then 


h(F)e(K/F) 


[K : F](Ep: Nxjp K* 0 Ep) 


Proof. We assume that the reader is acquainted with a minimum of Galois 
cohomology for cyclic groups. What is needed is covered for instance in 
Chapter IX, §1 of my Algebraic Number Theory, referred to as ANT. 

Let J denote the ideal group and P the principal ideal group. The group 
C§ occurs naturally at the beginning of the cohomology sequence associated 
with the exact sequence 


0O-> Pxrolp > Cx > 0. 


Note that I, is the direct sum of its semilocal components over primes of F, 
and by semilocal theory, we have 


HG, Ix) = @H"G,, D), 


where G, is the decomposition group. But H'(G,, Z) = 0, so 
H1(G, Ix) = 0. 
The exact cohomology sequence then yields 
0- PE + 18 +CE > H\(G, Px) > 0, 
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whence an exact sequence 

0 + I€/PE + CE + H'(G, Px) ~ 0 
which yields the index relation 
(1) CK] = Uk: PX)|H*(Px) |. 


We analyze the two indices on the right-hand side. 
From the inclusions 


Ie a Pi SP; 

we obtain the index 
(1g: Pr) 
(PX: Pr) 
= (Ig: Ip) (Up: Pr) 

(PX: Pr) 
he 
(Pg: Pr) 


(Ig: Px) = 


(2) = €o(K/F) 
We now use the second exact sequence 
0- Ex > K* > Px > 0. 
We get the beginning of the cohomology sequence 
0- Ep > F* > PE > H‘(E,x) > 0 
because H'(K*) = 0 by Hilbert’s Theorem 90. Hence 


(Pk: Pr) = |H(Ex)| 


[K:F] 


= 1E%E®)| = Ga 


(by Corollary 2 of Theorem 1, ANT Chapter IX), so by definition, 


(3) (Pg: Pr) = (Ep: NxjpEx)(K : Fl/e.(K/F). 


Once more, from the second exact sequence, we have another portion of 
the cohomology sequence 


0 = H'(K*) > H'(Px) > H°(Ex) > H°(K*). 
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The map from H°(Ex) to H°(K*) is the natural homomorphism 
Ep/N xp Ex oe F*/Nxjp K*. 
By exactness, we find 


|H*(Px)| = |Ker(Er/NxjpEx > F*/Nxjr K*)| 
(4) = (Nxjp K* O Ep: NxjpEx)- 


Using the inclusions 
Er pa (Nx rp K* Oo Es) D> NxrEx, 


and putting (1), (2), (3), (4) together proves the lemma. 


We shall apply the lemma as does Iwasawa [Iw 15] to get an example ofa 
field with a highly divisible class number. 


Lemma 4.2. Let | be an integer > 2. Let K4 be an extension of a number 
field K of degree d. Let q,,..., 4, be prime ideals of K which split completely 
in K,. Let K' be acyclic extension of K, of degree |, in which q,,..., 4, are 
totally ramified. Let Ki, = K'K,. Then 


I|C(Ka)| 
|C(K4)| 


[i -(K Q)d— 1 


is divisible by 


Proof. We have the diagram 


The extensions K, and K’ are linearly disjoint because of the way any one 
of the primes q; splits in them. Thus Kj has degree | over K,. We apply 
Lemma 4.1 to the cyclic extension Kj over K,. This yields: 


C K’ , 
I(Ra| is divisible by “KK _ 
|C(Kq)| [Kg: Ka] (Eq: Eg) 
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where E, = E(K,) is the group of units of K,. Indeed, E} is contained in the 
norm group from Kj. All we have to do is now estimate the divisibility of 
the expression on the right. 

Each ideal q; splits into d primes in K,, and each factor in K, is then totally 
ramified in Kj. Consequently 


e(K4/K,)_ is divisible by ['4. 


On the other hand, 
[Kij: Kgl = 


Thirdly, E, mod roots of unity has rank bounded by 
[K,:Q] —1=d[K:Q]—-1, 
and consequently 
(E,: E}) divides [#2 


Putting these three estimates together proves the lemma. 


§5. Examples of Iwasawa 


Let K be a number field and let K,, be a Z,-extension. Let q be a prime ideal 
of K. The decomposition group of q in = Gal(K,,/K) is either trivial or 
closed of finite index. In the first case, we say that q splits completely, and in 
the second case, we say that q is finitely decomposed in K ,,. 

Let / be a prime number, which may be equal to p. We are interested in 
giving examples of Iwasawa [Iw 15] for which the /-primary part of the ideal 
class group C, grows exponentially. We shall use the formula of the last 
section, applied to extensions where the ramification indices grow faster 
than the unit index in the denominator, times the degree. This implies that 
the class number on the left-hand side of the relation grows equally rapidly. 

Let K’ be a finite extension of K. Then K,, K’ = K’, is a Z,-extension of 
K’. If q splits completely in K ,,, then any divisor q’ of qin K’ splits completely 
in K‘,. This is an elementary fact of algebraic number theory. In particular, if 
qi,---,q,1in K’ lie above t such primes q,,...,q,, they will be distinct primes 
of K’ splitting completely in K’,. 


Theorem 5.1. Let K,/K be a Z,-extension. Let q1, ..., 4, be prime ideals 
of K which split completely in K,,. Let K' be a cyclic extension of K of 
degree l, in which q,,..., q, are totally ramified. Then 


ord,|C(K,)| = (¢ — [K: Q])p" — 1. 
Proof. This is merely a special case of Lemma 4.2. 
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A concrete example of the situation as in the theorem can be given once 
we have shown below how to construct a Z,-extension in which infinitely 
many primes of K split completely. Then we can take t arbitrarily large. 
We can then first lift the extension over the field K(p,), and thus assume with- 
out loss of generality that K contains the /-th roots of unity. We then select 
an element a of K divisible exactly by the first power of q,,..., q, (and pos- 
sibly other primes). We let 


K' = K(a1"), 


Then the hypotheses of the theorem are satisfied. 


Theorem 5.2. Let K be a CM field. Then: 
(i) There exists a Z,-extension K,. of K, Galois over K*, such that if 
T = Gal(K,,/K), then 
| eae Ga 


(ii) For any such extension, let q* be a prime ideal of K* which does not 
divide p and remains prime in K, so q is its unique extension in K. Then 
q splits completely in K,,. (Tchebotarev guarantees that there exist 
infinitely many such primes.) 


Proof. Let M,(K) be the maximal p-abelian p-ramified extension of K. 
By class field theory, e.g. Chapter 5, §5, there is a quasi-isomorphism 


Gal(M ,(K)/K) ~ U,/E, 


where U, is the product of the local unit groups U, at the primes p above p, 
and E is the closure of the global units in U,. Each real prime p* either 
remains prime in K or splits into two primes p,, p, in K. In either case, the 
semilocal component 


plp* 


of U, contains a subgroup of finite index isomorphic under the exponential 
map with 


for m sufficiently large. Here 0, denotes the local ring of integers at p. From 
this it is clear that U, has Z,-rank > 1. Furthermore, E contains a subgroup 
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of finite index which is real, so E contains a subgroup of finite index which 
is fixed under complex conjugation. Put 


G = Gal(M ,(K)/K), 
and for simplicity of language, assume that p is odd. Then 
G=G'XG, 


and the Z,-rank of 9 is > 1. Hence there exists a factor group I’ of Y such 
that ! = I~, and I is isomorphic to Z, as compact group. By Galois theory, 
l is the Galois group of a Z,-extension K ,, of K, which is normal over K*. 

Let q* be a prime ideal of K* which remains prime in K. Let D be the 
decomposition group of q* in the group 


G = Gal(K,,/K*). 


Then D is (topologically) cyclic because q is unramified in K,,. But & is 
“dihedral,” in other words, ¥ is generated by complex conjugation and [" 
satisfying the relations 
pyp = 7", 

for yéT. Since q* remains prime in K, we cannot have D <I. Then D 
contains an element py, and any such element has order 2. It is then im- 
mediately verified that D is cyclic of order 2 and that its intersection with T 
is 1. This proves that q splits completely in K,,, and concludes the proof. 


Remark. If p = 2, then one has to take a factor group of Y by Y'*? 
to obtain the minus part, and the argument is essentially the same. 


§6. A Lemma of Kummer 


Theorem 6.1. Let p be an odd prime. Let u be a unit in Q(u,). Suppose there 
exists an integer ae Z such that u = a(mod p). If p does not divide the class 
number h,, then u is a p-th power in Q(u,). 


Proof. Let K = Q(p,). By class field theory, it suffices to show that the 
extension K(u'/?) is unramified, because the hypothesis then implies that 
K(u!/?) = K, so u is a p-th power in K. Raising u to the (p — 1)-th power 
allows us to assume without loss of generality that u = 1 (mod p). 

We contend that 


u = | (mod zp), 
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where x = € — 1 and € is a primitive p-th root of unity. Otherwise, we have 
u = 1+ px mod zp 
with x equal to a p-unit in Z¥. But u is a global unit, so 
Nxj(u) = £1 = (1 + px)?’ = 1 — px (mod ap). 


In both cases where the norm is 1 or — 1 we get a contradiction. 
Let « = u’/? be any p-th root of u. Then 


= xX —1)/ 
eas is a root of ene 


nm? 
and this polynomial has p-integral coefficients. Its other roots are 


1 — Ca 


Tt FJ 
and the different is a product of terms 


(Ci — Oa 


’ 
™ 


each of which is a p-unit. Hence K(a) = K(u'/?) is unramified at p, and is 
trivially unramified at all other primes. This concludes the proof. 
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p-adic Preliminaries 


The first section introduces the p-adic gamma function of Morita. After 
that, we deal with a topic which can be viewed temporarily as independent, 
the Artin-Hasse power series, and the Dwork power series closely related 
to it. The latter allows us to obtain an analytic representation of p-th roots 
of unity, which reappear later in the context of gauss sums, occurring as 
eigenvalues of p-adic completely continuous operators. Cf. Dwork’s papers 
in the bibliography. 


§1. The p-adic Gamma Function 
In this section, we give Morita’s definition of the p-adic gamma function. 
To start, we let f be the function defined for positive integers n by the formula 
f(n) =(-1)" [Ti 
j=l 
i j=l 


We wish to show that there exists a continuous function on Z, which restricts 
to f on the positive integers Z*. For this purpose, it suffices to prove the 
following lemma. 


Lemma 1.1. For any positive integers N, n, k we have 
f(n + p™k) = f(n) (mod p*). 
Proof. Let G = Z(p™)*. Pairing an element and its inverse in G we find: 


We —1 mod p% if pis odd 
AYA mod p¥ if p = 2. 


314 


§1. The p-adic Gamma Function 


If p is odd, then 


f(n pa pNk) = (= 1 * I] j 


f(@) n+1<j<n+pNk 
Gi, pP)=1 


= (-0( Thi) = 1 mod p%, 


jeG 
as desired. The proof is similar for p = 2 and is left to the reader. 


By the lemma, we can extend f by continuity to all of Z,, and since f(n) isa 
p-adic unit for n equal to a positive integer, it follows that 


f:Z, > Zs 


is a map of Z, into Z>. 
We define the p-adic gamma function 


Pix) = —f(x« — 1). 


Thus on integers n > 2 we have 


Fy) = (19 TI 


We shall write F instead of I’, when p is fixed, and in particular, for the rest 
of this section. 

Let us now calculate a few values, especially of negative integers. If u(x), 
v(x) are continuous functions of a p-adic variable x, we define: 


u(x) ifxe ZF 


v(x) u(x) ifxe pZ,. 


With this notation, we have 


es is the function such that x -{ 


—nIT(n) 


Tn + eee 


ifn is an integer > 2, and consequently by continuity, 


—xI(x) 


_T(x) for all x € Z,. 


ro+n={ 


We call this the functional equation. 
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From (2) = T(1 + 1) = —T(), we get 
Td) = -1. 
From (1) = Tdi + 0) = —T(0), we get 
T(0) = 1. 
Theorem 1.2. For any integer n > 1, we have 
Tard =a) S(=1y or 
where the bracket, as usual, is the greatest integer function. 


Proof. The theorem is true for n = 1 by the above. We can then proceed 
inductively, using [(1 — n) = {"., [(—n), to get: 


T(0) = ( Tl i)" —n) 


j=l 
Gi, py=l 


where 6(n — 1) = number ofelements among 1,...,n — 1 which are divisible 
by p. This is immediate from the functional equation. Since d(n — 1) = 
[(n — 1)/p], the right-hand side is equal to 


(=D rd a). 

The formula of the theorem then follows at once. 

Let x € Z,. We denote for p odd: 

R(x) = representative of x mod p in the set {1,..., p}. 

If p must be in the notation, we would write R,(x). 

Theorem 1.3. If p # 2 then 

T(x)PQ — x) = (— LF. 
If p = 2, then 
I(x) — x) = e(x), 
where 


—1 ifx =0,1mod4 
a) = , if x = 2,3 mod 4. 
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Proof. By continuity, it suffices to prove the theorem when x is an integer 
n> 1. Write 
n=d) + ap t+ app? +-:-+4a,p", 


with ay € {1,..., p} and a; € {0,..., p — 1} fori > 1. Then 


=] r-1 
=a, te: +ap-. 
Pp 


—1 
n+ |? = |= 20-4 ext +p) + 0x00 + pt) Ho Hae + PD 


Hence for p odd, 


m+ [P=] a9 = Ren mod 2. 


This proves the theorem in the present case. 

If p = 2, then the parity of n + [(n — 1)/2] does not change for the 
elements in the residue class of n mod 4. We then determine explicitly 
the values of this number for n = 1, 2, 3, 4 to get the desired answer. 

If p = 2, then it is convenient to define 

R,(x) = R(x) = representative of x mod 2 in {0, 1}. 


In both cases, p odd or even, we define 


R(x) = RO). 


If p ¥ 2 note that for a positive integer n, 


Rn) = =| 
p 


This follows from the p-adic expansion in the proof of Theorem 1.3. 


Theorem 1.4 (Distribution relation). Let N be an integer >2 and prime to 
p. Then 


Nol (x +i NEA of 4 = 
Tr N )= reo Hr (Z)ave0 : 
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where gy(x) is given by the formulas: 


gn(x) = NRO) 1 Aye R(x) if p #2 
gu(x) = NRO if p = 2. 

Remark. Both R(x) and R(x) are continuous functions of x € Z,. Since 
R(x) is a positive integer, the exponentiation N®™ is well defined. Since 
N?~! = 1 (mod p), its exponentiation with R’(x) is also well defined. When 
x = nis a positive integer, then we can simplify the formula for gy(n), using 

R(n) + pR'(n) = n. 


For instance, for p odd, gy(n) = N"~17®'™, 


Proof. Define gx(x) by using the relation to be proved, so 


gn(x) = TIt( x) mre tt) "Teo. 


By continuity, it suffices to prove the theorem when x is a positive integer. 
We have 


gn(0) = 1 


(i) 
gx +1) Tix +1) N 


gx) — T(x) x 
r— + i 


From the functional equation, we get 


—xX 


aux + 1) = = ty ont) = 4 anon 
=i 


Hence for every positive integer n we find 


gn(n) a Nt 17-é@-1) = Nan iol Dipl, 


If p is odd, then [(n — 1)/p] = R'(n) and the assertion follows. The same 
argument also works for p = 2 and is left to the reader. 


The next result shows that the fudge product occurring in the distribution 
relation is a fourth root of unity. 
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Proposition 1.5. 


Tir(i) = {2 if N is odd 
N +1,+,./-1 if N is even. 


Proof. Suppose N is odd. We write 


Tr(a)="H(s)"('- 5) 


and each factor on the right is +1 by Theorem 1.3. If N is even, then 
TG) ="G),.fL,.P Cn)" a) 
ri—) =T(-= Tj—]r{i --—)}. 
i (i 2 ee N N 


T@)r@) = +1, 


Furthermore, 


and the proposition follows. 


§2. The Artin-Hasse Power Series 


Let p be a prime number. Define the Artin—Hasse power series by 


AH(X) = exp ‘ aa 


n=0 


As usual, exp is the standard power series for the exponential function. 
Then AH(X) has rational coefficients. 


Theorem 2.1. We have AH(X) € Z,[[X]]. 


Lemma (Dieudonné-Dwork). Let f(X) € 1 + XQ,[[X]]. Then 


f(x)? 


(Mel + XZ = Fm 


e1+ pXZ,[[X]]. 


Proof. The left-hand side obviously implies the right-hand side in the 
equivalence to be proved. So assume the right-hand side. If R is any ring, 
and 


Sf(xX)e1 + XR(LX]], 
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then a simple recursion shows that f(X) has an infinite product expression 
f(X) = []G - 4,X") witha, e R. 
n=1 


Furthermore, the elements a, are uniquely determined. 
Assume that f(X)?/f(X")€ 1+ pX Z,[[X]]. Suppose that some coeffi- 
cient a, is not p-integral. Without loss of generality, we may assume that 


f(X) = [] d -— a,X") = 1-a4,X" + higher terms 


n=r 


and a, ¢ Z,. Then 


f(XP _ 1 pa, X' +++ 


= = 1 — pa,X" + higher terms. 
(OO) Ti ax es aa 3 


By assumption, we must have pa, € pZ, so a, € Z,, which proves the lemma. 


We apply the test of the lemma to the Artin—Hasse series. We thus find 


AH(X)? _ exp(p )) X?"/p") 
AH(X?) — exp() X?""'/p") 


Dice 
= exp(pX) = LX" 


To apply the lemma it suffices that ord p"/n! > 1 for all n, which is the case. 
This concludes the proof of the theorem. 


Let us write 
xP = 
exp[x +—]}= Yoex*. 
p k=0 
We wish to give estimates for the coefficients c,. Let 


ord = ord,. 


We shall prove: 


(1) ord Cy > za (2 of 4}. 
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Proof. We can write 


exp(s + *) = AH(x) [] exp( - 1 


n22 

Write 

p” co) ich oo xpnm 

pry os! n = 1 m 
exp(- =) = Sat = Fy 
We prove: 
1 
(2) ord d” > — a ( + 4). 
p" pel 


Indeed, for any positive integer m, we recall that 


m — s(m) 


= 
ord m! a 


where s(m) = s,(m) is the sum of the coefficients in the standard p-adic 
expansion of m. Hence 


ord d®),, = inp Sin ee, 
p-1 p-l 


Factoring out k = mp” immediately implies (2). 


Since 
Cele eee 
n+——]=-—> —— }, 
n22 PB" p-1}) p? p-1 


it follows that 


k 1 
3 ord d(” =o (2+ 4}. 
(3) pad 


Hence the coefficients in the power series expansion of the product 
co) p™ P 
AH(x) [] exp( - =) = exp(x + *) 
n=2 Pp Pp 
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satisfy an estimate like (3), and therefore the coefficients c, satisfy 


This proves the desired estimate (1). 


For each element 2 such that x?~1 = —p, we define Dwork’s power 
series 


E,(X) = exp(1X — 1X°). 


This can also be written 
E,(X) = exp( x + (oy ox) - Seg" 


The coefficients e, lie in Q,(z). In particular 


ord e, € : Z. 
p-1 


As usual, ord = ord,. If we want ord, then we shall specify z in the notation. 
Of course, 


ord, = (p — l)ord,. 


Lemma 2.2. 


(i) We have ord e, > n(p — 1)/p?, and e, is p-integral. 
(ii) If n > 2 then ord, e, > 2. 


Proof. By (1), letting X +> 7X, we find at once that 


=1 
ord e, > n(? 5 } 
P 


To prove (ii), i.e. to prove that 


2 
ord e, > ——— _ forn > 2, 
p-l 
it suffices to show that 


1 
ord e, > —— 
p 
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because ord e, is a fraction whose denominator is p — 1. The inequality for 
ord e, follows if 


p- _ 1 ( Pp ) 
n >——, orequivalently, n> {———]. 
( p? po ? , pel 


For p > 5 orn > 5 there is no problem. For p = 3 and n > 3, there is also 
no problem. For p = 3 and n = 2, we get directly 


exp(1X — 2X”) = 1+ 2X + 4n?X? + higher terms, 


so we get the right lower bound for ord e,. For p = 2, we just compute 
explicitly the coefficients of X?, X3, X* and find that they are divisible by 4, 
thus settling the final cases, and proving the lemma. 


Remark. From the first part of the lemma, we know that the coefficients 
of E,(X) tend to 0. In particular, we can evaluate E,(X) by substituting any 
p-adic number for X in the power series if this p-adic number has absolute 
value < 1. However this value cannot usually be found by substituting the 
number directly in the expression exp(xX — 2X"). We shall see an example 
of this in the next section. 


§3. Analytic Representation of Roots of Unity 


Let p be a prime number. Throughout this section, we let 7 € C, be an element 
such that 


Lemma 3.1. For each element n€ C, such that n’~' = —p, there exists 
a unique p-th root of unity ¢, such that 


¢, = 1+ 2(mod zr’). 


The correspondence m++€, establishes a bijection between p-th roots 
of unity 4 1, and elements 7 as above. 


Proof. If ¢ is a non-trivial p-th root of unity, then (¢ — 1)?" 1 ~ p (where 
x ~ y means that x/y is a p-unit). If €,, €, are two primitive p-th roots of 
unity which are both =1 + x (mod 7”), then £, — C, = 0 (mod x7), whence 
¢, = ¢,. Since there are exactly p — 1 non-trivial p-th roots of unity, and 
p — 1 possible elements z in Q,(z) = Q,(¢), and since any element ¢ = 
1 (mod z) but € ¥ 1 (mod x?) can be written in the form 


& = 1 + yn(mod 7”) 
for some (p — 1)th root of unity y, both existence and the bijection follow. 
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Remark. Each root of unity ¢ gives rise to a character 
W:Z,> py, 


such that w(1) = ¢. The unique character whose value at 1 is ¢, will be de- 
noted by w,, so by definition we have the formula 


Wel) = ¢,, 
and w,(1) is the unique root of unity =1 + 2 mod z?. 


Theorem 3.2 (Dwork). We have E,(1) = ¢,, so E,(1) is the unique p-th 
root of unity =1 + nmodn?. For any ce Z, such that c? = c, we have 


E,(c) = E,(1)°. 


Proof. First we observe that E,(1) is defined by substituting 1 for X in 
the power series for E,(X), which converges in light of the lower bound for 
the orders of the coefficients. Then note that 


E,(X)? = exp(pnX — prX?) = exp(prX) exp(— prX”) 
because the factor p inside the exponent makes the two series on the right- 
hand side converge. Substituting 1 for X can now be done to see at once that 
E,(1)? = 1. To conclude the proof of the first assertion, it suffices to show that 
E,(1) = 1 + 2(mod x’). 
This is clear from Lemma 2.2(ii). Similarly, 
E,(c)? = exp(pn(c — c?)) = exp(0) = 1, 
and 
E,(c) = 1 + cx(mod x’), 
so E,(c) = E,(1), thus proving the theorem. 


Similarly, let ¢€p,., where q = p” is a power of p. Let T denote the 
absolute trace to F,,, and let 


Ung = Wael. 
Then 

Ung? Fa > Bp 
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is a character on the additive group F,, and we let 
Ey, (x) = exp(nx — mx) = E,(x)E,(x") >>: E,(x?”’). 
Theorem 3.3 (Dwork). We have 


Ey (0) = Wr, q(¢ mod p), 
and this is the unique p-th root of unity = 1 + T(¢)n mod 7”. 


Proof. From Lemma 2.2(ii), we know that E,(0)=1+¢x mod r?. 
Hence 


Ex, (6) = E,(Q)E,(C7) nae ECP ‘) 
= 1+ 7T(Q)x mod 2’. 


The same argument as in Theorem 3.2 shows that E,,,(¢) is the unique 
p-th root of unity satisfying the above congruence, thus proving the theorem. 


Appendix: Barsky’s Existence Proof for the p-adic 
Gamma Function 


We include this appendix to illustrate techniques which might be useful in 
similar contexts, say for p-adic differential equations, where an ad hoc 
argument as in §1 cannot be given. No use will be made of this appendix 
elsewhere in the book. 

We wish to show the existence of a continuous function of Z, into itself 
which takes on values related to the factorials at the positive integers. This 
is an interpolation problem, and thus we begin with a criterion for the 
existence of a continuous function taking given values. As in Chapter 4, we 
consider the space of power series 


xn 
g(x) = yD; nv be C,; 


with lim b, = 0 (the limit is of course the p-adic limit). This space is called 
the Leopoldt space. It is in fact a Banach algebra, under the sup norm of the 
coefficients, ||g||_~ = max|b,|.Ifg, h are in the space, so is the product gk and 


Ighllg < llgiielhlly. 
These properties are trivially verified. Note that a power series 


Y bx" 
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with coefficients bj, which are p-integral is in the Leopoldt space, as one sees 
by writing b, = bjn!. 
Theorem. Let {a,} be a sequence in C,. There exists a continuous function 
fiLy aC, 


such that f(n) =a, for all integers n => 0 if and only if the following 

condition is satisfied. Let 

ey a = Sb 
=0 n: =0 


= 


Then lim b, = 0 (p-adically, of course). 

Proof. We use Mahler’s theorem (Chapter 4, Theorem 1.3). Any con- 
tinuous function f has an expansion 

f@= 5, (*) with b, > 0, 
n=0 
and 
b, = (A"f)(0). 

Conversely, if f:Z* > C, is a function such that A"f(0) — 0 as n> , 


then f can be extended to a continuous function on Z,. (We denote by Z* 
the set of integers > 0.) Note that 


n _[n ; 
Arf) = Y(-1y ("\r0. 
i=0 
The theorem is then obvious in view of the identity 
= x" 2 . x" 
e LIM a dL AsO). 


We apply the theorem to the function f defined by 


fO=1, fMe= (ae I] i. 
<jsn 
G. py=1 
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Except for the power of —1, f(n) is equal to n! from which all the factors 
divisible by p have been deleted. Thus 


{:Z* + Zs 
takes its values in p-adic units. 


This function has a continuous extension to Z,. 


Proof. We have 


x" p- 00 pnt+i 
Li = 2 Bin 
Sy yn ti (Pn +)! ee 
EO 2! 2 p'n! (pn + i)! 
= exp : *) >( x)' 
i=0 


Multiplying by e~*, we obtain 
pes 


1 * 
(—x)’ 


i=0 


eS fo) = exp( —x + ca 


x" 
= Dba 


with some coefficients b,, which we must show tend to 0. The set of power 
series 


n 


x ; 
Lon with c, > 0 


is closed under multiplication. Hence after replacing x by —x, it suffices to 
prove that the coefficients c, of the series 


xP Pe ea 
exo(x + *) oe Ce 
tend to 0. But by (1) of §2, we know that 


ord ci > ord k! — Pe 
Pp pork 


_k=sk) k 1 
pail x (2+5-4} 
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Since 
s(k) < (p — 1)(1 + log, k), 


(where log, = high-school-log-to-the-base-p), we obtain 


, 1 1 1 
od 2 (4 ~ (2+) — 1 — log,k, 


from which it is clear that c,, > 0. This concludes the proof. 
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and Gauss Sums 


The history of Gauss sums as eigenvalues of Frobenius on Fermat or Artin— 
Schreier curves goes back to Davenport-Hasse [Da-H] and Hasse [Ha 3]. 
However, Ron Evans has pointed out to me that Howard Mitchell [Mi] in 
1916 considered Jacobi sums in connection with the number of points of the 
Fermat curve in arbitrary finite fields, and proved the first Davenport -Hasse 
relation between Jacobi sums in a finite field and in a finite extension. 

Dwork introduced for the first time spaces of power series with p-adic 
coefficients tending to zero like a geometric series (he denotes such spaces 
by L(b)). He represents Frobenius on factor spaces L(b)/DL(b), where D is an 
appropriate differential operator, and gets the Gauss sums as eigenvalues in 
this context, using a simple p-adic trace formula. 

Washnitzer-Monsky saw the advantage of taking the union 


U) L@), 


b>0 


to obtain a more functorial theory. They also introduced in addition certain 
affine rings over the p-adic numbers, lifting affine rings of hypersurfaces in 
characteristic p. 

Several years ago, Honda looked at Gauss sums again in connection with 
the Jacobian of the Fermat curve [Ho], and conjectured an expression as limit 
of certain factorials. This was proved by Katz by taking the action of 
Frobenius on rather fancy cohomology (unpublished letter to Honda). More 
recently, Gross-Koblitz recognized this limit as being precisely the value of 
the p-adic gamma function [Gr-Ko]. In a course (1978), Katz showed that 
by using the Washnitzer-Monsky spaces and other techniques of Dwork 
(e.g. his special power series, and estimates of growths of certain coefficients), 
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he could give an elementary proof that the eigenvalue of Frobenius acting 
on the space associated with the curve 


yr — y= x™ 


was equal to the appropriate expression involving the p-adic gamma function. 
To establish the equality of this eigenvalue with the Gauss sum, he still 
referred to the rather extensive theory of Washnitzer-Monsky and their 
“trace formula,” without going through the rather elaborate proofs. 

On the other hand, Dwork in [Bo] showed that he could recover the 
Gross-Koblitz formula by working entirely with his spaces, and by using the 
completely elementary trace formula which he had proved already in his 
first paper on the zeta function of a hypersurface [Dw 1]. This paper, and a 
subsequent one, contained a concrete instance of what Serre [Se 3] recognized 
as p-adic Fredholm theory, giving a self-contained treatment which systema- 
tized Dwork’s proofs in those respects which could be viewed as abstract 
nonsense. 

Thus finally it was possible to give a completely elementary proof of the 
Gross-Koblitz formula. The exposition given here follows Katz in first 
obtaining the eigenvalue of Frobenius in terms of the gamma function. The 
second part, getting the Gauss sums, was worked out in collaboration with 
Dwork, mixing in what seemed the simplest way his spaces and those of 
Washnitzer—Monsky. 

It also turned out that the use of the Artin-Schreier curve was unnecessary 
for the derivation of the formula, so the connection with that curve is post- 
poned to the next chapter. I am much indebted to Katz for a number of 
illuminating comments. For instance, although for the special Artin- 
Schreier curve y’ — y = x% we can work ad hoc, Katz in his thesis [Ka 2] 
worked out completely in the general case the relations between Dwork 
cohomology and Washnitzer-Monsky cohomology. My purpose here was 
to derive the Gross—K oblitz formula essentially as simply as possible, and to 
avoid such general theories. 


§1. The Basic Spaces 


Let p be an odd prime, N a positive integer prime to p, and q = p’ such that 
= 1 mod N. We let 


We let 
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be the Teichmuller character, and we let 
Wr, = Wa oTr 


be the additive character formed with the absolute trace Tr and the character 
w, defined at the end of the last chapter. We let 


(wo, Wr,q) = — S(wj, Wn,q) 


be the negative of the Gauss sum, where 


Sayw = Y xQWeo. 


xeF, 


We let R be a discrete valuation ring of characteristic 0, complete, such 
that the maximal ideal contains the prime p and also contains 2. We let K 
be the quotient field of R. 

We shall work with the ring of power series R[[x]]x whose coefficients 
have bounded denominators. This is also given by 


RUDVx = RED J)L1/p1. 
We want a vector space 
Din & R((x]]x 


such that, if we put 


d 
Hie = jn “|i, 


then dim H,,, = 1 and H,, is a representation space for the Frobenius map 


OF having t(w$, Wz,4) as eigenvalue. We shall construct such a space. 
We shall use an embedding 


emb,: R[[x]] > R[[x]] 
given by 
p(x) + x! exp(—1x™)ep(x%). 
We have a corresponding embedding into differential forms 


emb* : R[[x]] > R[[x]] dx 
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given by 
N ny 4X 
P(x) Hx! exp(— mx" p(x") —- 


(We use here the assumption that j > 1 so that no negative powers of x 
occur in the right-hand side.) 
We have a commutative diagram 


R(E[x]] "> REExI] 
DW 1. vo, | 


RELxT) sage REL«T) ax 
where D, is the differential operator given by 


d : 
oo ee ee 


D 
dx N 


This is immediate from the rule giving the derivative of a product. 


Remark. For any « not equal to an integer <0, one verifies at once that the 
differential operator 


d 
D=x—-— + 
xs TX +o 


is injective on power series, by looking at its effect on the term of lowest 
degree. 


For each positive number 6 we define 

L(6) = K-vector space of power series )\ a,x" in R[[x]]x such that 

ord a, — 6n— ©. 
This condition could also be written ord a, = dn + oo(n). 

Remark. This is a variation of Dwork’s definition, and one may think of 
elements of L(5) as functions holomorphic on the closed disc of “radius” 
p’. As usual, we take 

ord = ord,. 
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We define 


LO+) = U L(6). 


We let 


A ;,, = Image of L(0+) under emb; 
= space of power series of the form x/ exp(—2x")g(x") with 
v(x) e L(O+). 


This is the space used to define H, ,, by the formula above. Thus we have an 


isomorphism 


coming from commutative diagram DW 1, and induced by emb,. 


Lemma 1.1. Let « € Z, and suppose that « is not an integer <0. Let 
d 
D=x ae TX + 
(i) We have a direct sum decomposition 
L(+) = K @ DL(O+). 
(ii) If 6 => 1/(p — 1), then we have a direct sum decomposition 


L(6) = K @ DL(6). 


Proof. We wish first to write an arbitrary series g(x) as some constant plus 
Dg for some g. We first solve this problem for powers of x. We have for integers 
m>0: 


Dx™ = (m + a)x™ _ mnt - 


so that 
1 1 
x™*l = —(m + a)x™ — o(2 x"), 
7 1 
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Recursively, we obtain 


gore = : (m + «)(m — 1 + @)---(a) 


gmt 


1 1 

~o| 2x + (m+ ajx™™) +--- 
Tt nT 

1 ~ 
+ calm + a) (m — i + 1+ a)xm™™h + eee], 
Hence we have proved what we wanted for powers of x, of degree >1. 
Next, we see what happens for a series in L(0+): 
foo} 


= 1 
()Y Pmerx™ =D bye at (mt + a) (@) 


m m=0 


-p 5 =| 3 Dm +1 —(m +a)---(nt+1 +. 


m=n 


This gives a formal solution, and we want to see that the right-hand side 
lies in 


K + DL(O+). 


For this we need a lemma giving estimates for binomial coefficients. 


Lemma 1.2. Let « € Z,, and let n be a positive integer. Then 


ord a(a — 1)---(a—n+ 1) - 2 —1 — log, n. 


Proof. Obvious, from 


mv" n\n 


sen ent E(t) 


from the fact that the binomial coefficient is p-integral, and from 


n — s(n) s(n) 
ae <1 + log,n. 


ped 


ord n! = 


Since the coefficients b,,,, tend to 0 like a geometric progression, it is 
clear that the first series giving the constant term in the formal solution is 
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actually an element of K. As for the second, there is some positive 6 such 
that 

ord b,,4, => 6(m + 1) + co(m). 


Hence by Lemma 1.2, for m > n we get 


1 
(**) Ord bm +1 maa (m + a)-++(n +144) 


Sey ae ae 
p-1 p-1 


+ co(m). 


This proves that the coefficients of x” tend to 0 like a geometric series, and 
hence proves that 


L(+) = K + DL(O+). 


There remains to prove that this sum is direct, in other words that 1 # Dg 
for g € L(0+). This is a special case of the following lemma. 


Lemma 1.3. The equation Dg = 1 has a unique solution 
g(x) = 0b, x" € K[[x]], 


and the coefficients satisfy 


1 
Ee gay eR 1). 


Proof. Write 


Then 
1+ > mb, x"*! = y (nb, + ab,)x", 
so that 


1= abo and 2b,_, =(n+ a)b,. 
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Hence 


me" a" 


b, = = : 
"(n+ a)---(a) n+o 
(n + ("> ) 


The same estimate as before, using Lemma 1.2 shows that 
1 
ord b, < 1+ po + log,(n + 1), 


which proves the lemma. 
From the lemma we see that the formal solution of Dg = 1 cannot lie in 
L(O+). This concludes the proof of Lemma 1.1(i). 


For the second part we disregard the factorials completely (they are p- 
integral), and use the more trivial estimate 


ord b,,4, = 0(m + 1) + co(m). 


Then (**) is now estimated by 


ord By + (m+ a)--(n-+ 1 +a) 2 dm + 1)~ "= + cotm) 


mnt 


> dn + co(n) 


because 6 > 1/(p — 1). This proves the lemma, because the rest of the proof is 
identical with what we did before. 


Theorem 1.4. We have isomorphisms for 6 > 1/(p — 1): 
L(6)/D;L(6) = L(0+)/D,;LO+) © Aj. 55 
and these spaces are 1-dimensional. 


Proof. Immediate from Lemma 1.1. 


§2. The Frobenius Endomorphism 
On the power series ring K[[x]], we have the Frobenius endomorphism 


®,: K([x]] > K(+]] 
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such that x ++ x?, and similarly ®, = ®,. Then for h(x) € K[[x]] we have 
OF (h(x) dx) = h(x") dx? = h(x")qx4 = 
First we determine the mapping corresponding to the Frobenius under the 
embedding emb*. Thus we define 
B;p = XP -INE (x), 
where j’ is the integer satisfying 
1<j <N-1 and j =pjmodN. 
Then we have commutative diagrams 


emb, 


R([x]] —> RIX] 
FR IL. 2. |e 
RECxT] ap, RIT 
and 
RELI] > RE[xI] dx 
pPB,, | | 
RULx]] ~no?? RED] ax. 
The commutativity is done by direct verification, and is immediate, starting 
with a power series p(x) in the upper left-hand corner, and using the defini- 


tions of the various mappings going around one way, and then the other way. 
A direct verification also shows that 


©,:0;,,.7 Dj. 
maps V, , into %, ,, and furthermore 

OF od =do®,. 
Consequently we obtain a homomorphism 


OF :H,,7H 


J,m 
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while the diagram FR 1 yields the corresponding diagram 


LO+)/D; L0+) "4H, 
FR 2. r.,| |» 
LO+)/D; LO+) “nee Hy 5: 


For our purposes, we also want to look at a composite Frobenius endo- 
morphism, so we let 


Bjq = X°Eg, (x)®,q. 
Then we have the commutative diagram 
L(0+)/D,L0+) "2 H,,, 
FR 3. Big | |e 
L(O+)/D;LO+) -aapp Hin 


Thus qB,,, is an endomorphism of L(0+)/D,L(0+), corresponding to the 
endomorphism ®¥ under the embedding emb}. 


Inverse of the Frobenius Endomorphism 


We also want to tabulate formulas about the inverse of the Frobenius 
endomorphism, actually one-sided. We define it on power series by 


Yo: > a,x" > a,x", 


qin 


and on differential forms by 


1 dx 
=x"1— if g|n 
wr Pes = 44 = . 
4 x 
0 otherwise. 


Then 


Again we have the commutation rule (trivially verified) 


DW2. Prod =doW,. 
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Let 
p= Ajg = Vqe (x Eg, g(x) *). 
This means that A; is the composite of multiplication by Sai Oar 9 Oi 


followed by ',. Since 0 < a < q, it follows that A; maps x-integral power 
series into x-integral power series, and we have 


Aj.q ° Big = id. 
We shall need the commutation rule 


This can be proved either directly, or by the same pattern as for the cor- 
responding fact for B;, using DW 2 and FR 3, together with the fact that 
emb, is injective. This gives rise to the commutative diagrams: 


RITA ROT RET] 2 RET) 
FR 5. “| |e and ‘| ke 
ROT aanp? ROT RET aa? RELI 


We now consider the effect of the inverse of the Frobenius endomorphism 
on the Dwork spaces. We see directly from the definitions that 


W,: L(6) > L(q6) 


maps L(6) into L(q6). 
Furthermore, let g € L(6). Then multiplication by g, 


froaf 


maps L(6) into itself. 
Let 6’ satisfy 6’ < 6 < q0’. Let g € L(0’). Then 


¥,°g:L(6) > L(d) 
maps L(6) into itself. Indeed, this map is obtained as a composite map: 
L(6) > L(6) 4 (5) Ls’) “> L(S). 
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We shall apply this to the case when g(x) = E,(x). Since p > 3 we can 
select 


and then select 5’ < (p — 1)/p’*' but close to(p — 1)/p’*! such that 
5 <6 < go. 
Then 
E,,, q(x) € L(6’). 
Lemma 2.1. Under the above conditions on 6 and 6’, 
Aj;,q: L(6) > L(6) 
maps L(6) into itself, and induces a homomorphism 
Aj,_: L(6)/D,L(6) > L(6)/D;L(6). 
Proof. This is a special case of the preceding discussion, except that the 
negative power x“ occurs inside the operator A;. However, since a < q — 1, 
we have already seen that ’, annihilates such negative powers, so A;,, maps 


L(6) into itself. The commutation rule FR 4 shows that A;,, induces a 
homomorphism on the factor space mod D;L(6), as desired. 


By Lemma 1.1 we know that this factor space is one-dimensional. Con- 
sequently, the operator A;,, has an eigenvalue on this space, which we 
denote by 1; , = 4;. By definition, we have the relation 


where h; is a uniquely determined power series, which lies in L(6). Our 
object is to determine A;. We shall show that 4; is a Gauss sum. 
We have a commutative diagram 


L(0+)/D;L(O +) —— L(6)/D;L(6) 
FR 6. al la 
LO+)/D;L(0+) ——> L(6)/D;L(6). 
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The horizontal maps are the inverses of those obtained from the natural 
inclusion L(6) < L(0+), see Theorem 1.4. We know that 

A j oB p= id, 


so we conclude that the eigenvalue of B; is the inverse of the eigenvalue of 
A, 


ie 


§3. The Dwork Trace Formula and Gauss Sums 


Consider first the complete space R€x),, consisting of the power series 
whose coefficients tend to 0. Each such power series converges on the (closed) 
unit disc. We may view the powers of x, namely 
1, x, x?,... 
as forming a “basis” for this space. If 
u: RKx), > RKxY, 

is an endomorphism, let us work formally, and represent u by an infinite 
matrix with respect to this basis. We define the trace to be the sum of the 
diagonal elements. For this to make sense, we must have appropriate 
conditions of convergence, and in §5 we justify the formal computations 
which we shall make here in light of the p-adic Banach Fredholm theory of 


completely continuous operators. 


Theorem 3.1. Let g(x) € R€x),. Let 1 <a<q-— 1. Then 
(q — 1) tr(¥, ° x “g(x)) = 2 6° %4(0) 


where the sum is taken over € € p,_. 
Proof. Write 
h(x) = x~%g(x) = } a,x". 
Then a, — 0 (p-adically). Let 
P(x‘h(x)) = ¥ a;,x/. 
Then a;; is the coefficient of x” in x‘h(x), and so 
ij = Agj-i- 
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Hence a;; = ajq—1);, and so 
(q - 1) Bs ay, = (q — 1) ¥ aq 1: 
But 
VA = Ya,c", 
i ian 


and in this sum, the n-th term is equal to 0 unless n is divisible by g — 1. The 
remaining terms give precisely the desired value stated in the theorem. 
Let 


(0g: F(q)* > wy-1 
be the Teichmuller character, such that 
@,(c) = c mod p. 
The Gauss sum is defined by 


Suv) = Dd xcW(c). 


ce F(q)* 


We apply the preceding discussion with g(x) = E,, (x) '. Then Theorem 3.3 
of Chapter 14 yields: 


Theorem 3.2. Let a = j(q — 1)/Nand,,, = W,° T, where T is the absolute 
trace to the prime field. Then 


(q — 1)tr A, = S,@jz*, Wz). 


Note that the additive character is determined in the usual canonical way 
(composing with the trace) from the character y, on the prime field. Hence 
we shall usually omit the additive character from the notation. 


Remark. In the next theorem, we assume that the formalism of determin- 
ants works in the present situation. This is justified in the last section of the 
chapter. Specifically, the operator A;,, is viewed as an endomorphism of 
L(6). A Banach basis consists of appropriate scalar multiples of the powers 
x", and the argument used in Theorem 3.1 applies to this case to yield 
Theorem 3.2. Cf. §5, Example from Dwork Theory. 

In the next theorem, we follow usual notation, and let 


t= —S, 
be the negative of the Gauss sum. 
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Theorem 3.3. Let 1 <j < N — 1, and a = j(q — 1)/N, where N divides 
q — 1. Let 6 be a rational number such that 


ep eer 
p-1l 


Let W = L(6)/D;L(6). Let 4; be the eigenvalue of A, on W. Then 
My = Tq *, Waa): 
Proof. By FR 4 of §2, we have a commutative diagram with exact rows: 


0 ——> L(6) > L(6) ——> Ww —— 0 


a | ig 


0—— L(6) > L(@6) —- W—— 0. 
By the additivity of the trace (cf. Proposition 5.6 below), we have 
tr A; = tr qA, + tr Aj. 


Since W is 1-dimensional, tr A ; is the eigenvalue of A j- The theorem is now a 
direct consequence of Theorem 3.2. 


§4. Eigenvalues of the Frobenius Endomorphism 
and the p-adic Gamma Function 


By Lemma 1.1(1) we know that 
: d 
w; = x’ exp(—1x") = 


represents a basis of H;,,. Consequently there is some element A(j, 2, N) 
such that 


(1) ®*(w,) = Aj, x, Noy, in Hyg. 


We shall relate this element A to the p-adic gamma function. 
Note that ®f is an endomorphism of H,,,,, So its eigenvalue A, is given by 


(2) dq = AG, m, N)ACI, m N) ++ AGO, 1, ND. 
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Theorem 4.1. We have 
“h-§ 
AC, Tl, N) _ —ei-Fn . 
Proof. (Following Katz.) We have 
o* (~ exp(—7x™) *) = Ax! exp(—1x") + dg 
with some g € R[[x]]x, that is g is a power series with bounded denominators. 
Expanding out, we find 


: re . —7)" awa; AX 
pri )" MPN + pi : = jn.) —™ x N + j a 


Equating coefficients, using npN + pj = mN + j’, we obtain 


(—7)" 


(3) p— = An nN)S— a 


+ O(npN + pj). 


Note that O(npN + pj) = O(n + j/N), and 


HT 


m=npt+ N 


This yields 
: mont! ! 
p = AG, n, N)(—2) n+ ons d) 5 
On the other hand, we have 


(4) T(t + m) = (-1iem 
nip 


This is immediate from the definition of the p-adic gamma function, and the 
fact that a positive integer i < m divisible by p can be written in the form 


pi-J 
pN ’ 


i= pio with ip S<n+ 


and therefore ip < n. Hence the denominator n!p” is exactly the product of 
such i divisible by p. 
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Substituting (4) in (3), using the fact that 7?~1 = —p, and therefore 
n'P— 1) = (—p)", we find 


AG,N, 2), j\n! 
5 —pa= ee FpPI- IN 4 Ol n + =) —. 
() ae ee ONT 


The following lemma then concludes the proof. 


Lemma. There exists a sequence of positive integers n such that 


(i) n — —j/N p-adically, and hence m > —j'/N; 
(ii) ord(n + j/N) + ord(n!/n") > 00. 


Proof. First we simplify slightly the exponent in (ii). We have: 


‘ ; : = 
ord + i) + ord = = ora + #) ee AO, 
N Tm 


N p-1l p-1 
_ j\_ _s@) 
= ord(n + 2) ae 


Let q = p’ be such that N divides g — 1. Write the expansion 


2 LIGHT) AG) 
N  NU—q) N 


(l+q+q?+-->). 
There is also a finite expansion 


J(q — 1) 


N =d)t+aypt+-:::+a,_;p" 


with standard integral representatives 0 < a; < p — 1. Let 


d= 4) 


1 bees M1). 
N Cages egy 


For M tending to infinity, we see that n approaches —j/N to satisfy (i). 
Furthermore 


ora(n + i) > M-ordq = Mr. 


On the other hand, since j(q — 1)/N < q — 1, we have 
s(n) < M((p — 1)r — 1). 
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The second condition of the lemma is then obviously satisfied also. This 
concludes the proof of the lemma, and of Theorem 4.1. 


Theorem 4.2. Let A, be the eigenvalue of Frobenius ®f on H,,,. Then 
Aq = Tq(@4; Wr, q)- 


Proof. This follows from Theorem 3.3, the remark at the end of §2, FR 3, 
and the fact that 


ux, Wx" Wo") = @. 
Let a be a positive integer, with 1 < a < q — landgq = Pp’. Write 
a= dy) + a,p +---a,_yp" 
with integers a; such that 0 < a; < p — 1. As usual, define 


s(a) = Di a;. 


Theorem 4.3 (Gross—Koblitz formula). 
‘i _ r-1 iq 
rb Vad) = (Var Tr (1 — eh) 
i=0 q-1 


Proof. We merely put together Theorem 4.2, and the expression for the 
eigenvalue obtained in Theorem 4.1. The only quantity remaining to be 
worked out is the power of x appearing on the right-hand side, and this 
follows from the following lemma. 


Lemma 4.4. Let a = j(q — 1)/N. Then 


(i) (i+ 1) 
lest CeO O8 eaea! ee 
Day = s( x )=s 


imodr 


Proof. This is essentially the easy Lemma 1 of Chapter 1, §2. Indeed, 


Ube gq) 


N q-1 


where a“ is defined by 


; @ ig 
t<a®<q-—1 and ee: ). 
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With this change of variables from j to the corresponding a, the present lemma 
is identical with Lemma 1, loc. cit. 


Let $8 be the prime in the algebraic closure of Q,. In the next theorem we 


show how Stickelberger’s result on the Gauss sum mod $B follows from the 
Gross-Koblitz formula. 


Theorem 4.5 (Stickelberger). Let y(a) =| Ja;!. Then 


mor", Wag) = 


“ies mod §. 


Proof. From the formula tt = q, and Theorem 4.3 we obtain 


. og’ wea) _ (=I 
569) G(a) 4 


where 


CMe) eg) 


Replacing z by —7 in the left hand side of (*) yields 


1 OP (ay a We. aX es 1), 


The theorem then follows from the next lemma. 


Lemma 4.6. 


r,(1 - (2 *)) = (—1)'*%a;! mod p. 


Proof. Note that 


V1 + a) = (—1)'*%a;! 


This is true by definition if a; > 2 because a; < p — 1. It is also true when 


a; = 1 and a; = 0 by the direct computations of Chapter 14, §1. On the other 
hand, by Lemma 1.1 of Chapter 14, we know that 


rji+x)=7T,(1+y) if x =y mod p. 
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Thus it suffices to prove that 


- (Pe) = a;mod p. 


As usual, we write cyclically 


a=dyt+apt+---+a4,-yp"' 


pa =a,_, + ap +---+a,-2p"' 


mod q — 1 


The desired congruence follows at once, thus proving the lemma and the 
theorem. 


§5. p-adic Banach Spaces 


The purpose of this section is to do as rapidly as possible the linear algebra 
in p-adic Banach spaces justifying the use of determinants and traces in this 
chapter. Thus we cover only part of Serre’s paper [Se 3], whose exposition 
leaves nothing to be desired. 

Let R be acomplete discrete valuation ring, with quotient field K, maximal 
ideal m, residue class field k, and prime element z. 

By a Banach space E over K, we mean a normed space which is complete, 
and such that the norm x +> |x| satisfies 


|ex| = |cllx| and |x + y| < max{|x|, |y]} 


for ce K and x, ye E. We also assume that the value group |x| (for x € E, 
x #% 0) is the same group of positive real numbers as the value group of K*. 
So for each x € E, there exists ce K such that |x| = |c|. 

Let E, F be Banach spaces over K. Let L(E, F) be the vector space of 
continuous linear maps 


u:E > F. 


As usual, we can define a norm on L(E, F) by 


|ux | 
ju| = sup —. 
x#0 |x| 
Also as usual, we have 
ju] = sup |ux|. 
|x| <1 
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By an isomorphism, we mean a norm-preserving continuous linear map 
having an inverse. 


Example. Let / be a set of indices, and let C(/, K) be the set of families 
x = (x)j;<1, X; € K, such that x; tends to 0 as i > oo. By this we mean that 
given « there exists a finite set of indices S such that for i ¢ S, we have |x;| < e. 
The reader may as well think of the positive integers where i = 1, 2, 3,.... 
We define 


|x| = sup|x;|. 


Then C(/, K) is a Banach space. 

More generally, if F is a Banach space, we may form families (x,) with 
x; € F such that x; > 0. Such families with the sup norm form a Banach 
space C(/, F). 


A family {e,;} in a Banach space E will be called a Banach basis if every 
element x € E can be written uniquely as a series 


x= ) xe, with x; +0 


iel 


and |x| = sup|x,|. It is clear that the space CU, K) has such a basis, with 
e(i) = | and ei’) = Oif i’ # i. 

Let E be a Banach space and let Ey denote the subset of elements x € E 
such that |x| < 1. If {e;} is a Banach basis, then E, consists of the set of all 
elements 


> Xie; 
such that |x,;| < 1 for all i. 


Lemma 5.1. Let m be the maximal ideal of R. Let E = Ey/mEg. A family 
{e;} in E is a Banach basis if and only if all e; € Eg, and their images 2; in E 
form an algebraic basis of E as vector space over k. 


Proof. Suppose that {e;} is a Banach basis for E. Then first it is clear that 
every element of E can be written as a linear combination of the @, with 
coefficients in k, and such a combination is unique, as one sees by lifting 
back to Eo. 

Conversely, suppose {é,} is an algebraic basis for E. Any element x € Ey 
can be written 


x =) xfPe, + ax with x € Ep. 


349 


15. The Gamma Function and Gauss Sums 


Iterating, and expressing x“ in a similar way, we get an expression 
x=) x,e, with x;¢R and x; 0. 
Such an expression is unique, for if we have a relation 
>Y x:e, = 0, not all x; = 0, 
then first we may divide by the highest power of x dividing all x;, so that not 
all x; are divisible by z, and then we may reduce mod m to get a contradiction. 
Finally if |x| = 1, we have 


|x| = sup|x,l, 


and the same relation holds for any x by multiplying with an appropriate 
power of z. This proves the lemma. 


Proposition 5.2. Every Banach space over K is isomorphic with a space 
CU, K). Equivalently, every Banach space has a Banach basis. 


Proof. Immediate from the lemma, by lifting an algebraic basis from E 
back to Eo. 


Corollary. Every closed subspace F of a Banach space E has a comple- 
mentary closed subspace F’ such that E x F x F’. 


Proof. Let F be a closed subspace of E. By the theorem, there exists a 
Banach basis {e;} of the factor space E/F. Let e; be a representative of e; in E 
such that |e;| < 1. The map 

ule; e; 
extends to a continuous linear map E/F > E, of norm < 1. Then the map 
E/Fx F>E 
given by 
(X, y)k> uX + y 
is an isomorphism, as is immediately verified. This proves the corollary. 
Let F be a Banach space. Let J be a set of indices, and let B(/, F) be the set 


of bounded maps of J into F, i.e., the set of bounded families (f;);-, with the 
sup norm. Then B(/, F) is a Banach space. 
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Proposition 5.3. Let E = C(I, K) and let {e;} be a Banach basis. T hen we have 
an isomorphism 


L(E, F) > BU, F) 
given by 
ub> (ue;)ier- 


Proof. Let @ be the map from L(E, F) to B(/, F) as given in the statement 
of the proposition. Conversely, if (f,) is a bounded family in F, define a map 


w: BU, F) > L(E, F) 
by associating to (f;) the map u such that 
u() xe) = 5; Xi fi. 


It is then immediate that ¢, y are continuous linear, inverse to each other, and 
have norms <1, so that they are isomorphisms. This proves the proposition. 


The proposition will be used especially when F has a Banach basis, so 
that F = C(J, K) for some set of indices J. Each element ue; can then be 
written uniquely 


 g 
ue; = y Cie; 
i 


where (e’) is the natural Banach basis for C(J, K). Thus u has an associated 
matrix (c;;), relative to the bases (e;) and (e’). By the definitions, and Proposi- 
tion 5.3, we have 


|u| = sup Icijl. 
i,j 


Example. Let F = K so that L(E, K) is the dual space, denoted by E*. 
If we let ue; = c;, then 


|u| = sup|c;|. 
i 


Let u € L(E, F). We say that u is completely continuous if u is a limit of 
elements of L(E, F) with finite dimensional image. (The limit of course taken 
with respect to the norm on L(E, F).) We denote by CC(E, F) the space of 
such completely continuous linear maps. It is obviously a Banach subspace 
of L(E, F). 
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Let E, E’, E” be Banach spaces, and let 
E ale E' ey E" 
be continuous linear maps. If u or v is completely continuous, then so is v ° u. 
This is immediate from the definition. In particular, CC(E, E) is a two-sided 
ideal in L(E, E). 


Suppose now that F = C(J, K) with Banach basis (f;). Let ue L(E, F). 
For each x € E we can write 


ux = Dufx)f; 
where u, € E* is an element of the dual of E, i.e. a functional. We define 
ru) = |u;l. 
Then 


|u| = sup r(u) = sup |u;l, 
Jj J 


directly from the definitions. 
Let E = C(I, K), and let (c;;) be the matrix associated with u, relative to 
the bases (e,) and (f;). Then 


|u;| = sup|cijl- 
This is clear from the example following Proposition 5.3. 
Proposition 5.4. Let F = C(J, K), with Banach basis (f;). The map 
ub (uj) 
gives an isomorphism 


CC(E, F) = CU, E*) 


between CC(E, F) and the space of families (u;) of elements in E* such that 
u;— 0. In particular, an element ue L(E, F) is completely continuous if 
and only if u; — 0. 


Proof. First suppose u has finite dimensional image, so without loss of 
generality, we may assume the image is one-dimensional. Then u(x) = v(x) f 
for some f € F, and v is a functional. Then u; = f;v, where /; is the j-th 
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coordinate of f, so clearly u; > 0. Next let u” € L(E, F) approach u, and have 
finite dimensional images. Then 


n) 
uy? > u; 


uniformly, so u; > 0 also in this general case. 
Finally, let (v;) be a family of elements in the dual E* such that v; > 0. 
Define u by 


ux = ) vx) fj. 


Then u is certainly continuous linear. The partial sums us defined for every 
finite set S of indices by 


us(x) = )vfx) fj 


jes 


approach u in the norm of L(E, F), and have finite rank. Hence the cor- 
respondence between CC(E, F) and C(J, E*) is bijective. It is immediately 
verified to be norm preserving. This concludes the proof. 


Example from Dwork theory. Let E = L(6), with the norm defined as 
follows. If 


f(x) = Yia,x" 


is in L(6), we define 


I fils = sup 


apn 
nd 
P 


Then E is a Banach space. We assume that 6 is rational, and that the constant 
field K has been extended by a finite extension if necessary so that p® lies in K. 
The powers 


(p*x)", n=0,1,2,... 
form a Banach basis for E. If 6’ < 6, then the inclusion 
L(6) > L(6’) 
is completely continuous. This is obvious by looking at the associated matrix, 
and using Proposition 5.4. 
On the other hand, if g € L(6'), then the map f+ fg is a continuous 


linear map of L(6’) into itself. Furthermore, the Dwork map YY, is a con- 
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tinuous linear map of L(6’) into L(q6’). It follows that the Dwork operator 
A,,, is completely continuous on the space L(6). 

We now come to the results on Fredholm determinants which were used 
in the context of Dwork theory. First we make some remarks about de- 
terminants in finite dimensional spaces. 

Let L be a free module over a commutative ring, and let u be an endo- 
morphism of L such that u(L) is contained in a finitely generated submodule. 
Let M be a submodule of L which is finitely generated, free, and contains 
u(L). Let 


uy:M—>M 
be the restriction of u to M. The polynomial 
detU ro tuy) 


is well defined, and it is easy to verify that it does not depend on the choice of 
M. We may therefore denote it by 


det — tu) =1Ltet te +egt™+--°. 

The coefficients c,, can be expressed in terms of u as follows. 
Let (e;) be a basis of L. Let S be a finite subset of the set of indices I, with 
card S = m. Let A = (a;,) be the matrix of u with respect to this basis, and let 


dets(A) = determinant of the submatrix of a;; with i,j € S. 


Then 


Cm = (- 1)” s det;(A), 


|Sj=m 


where the sum is taken over subsets S having m elements. 

This formula actually relates to a finite square matrix. It suffices to prove 
it when the matrix consists of algebraically independent elements in character- 
istic 0 (by specializing afterward). In that case, the matrix has distinct 
eigenvalues, and the formula is obvious if the matrix is diagonal. To get the 
formula in general, all we need is an invariant characterization of the terms 
in the sum over S. This is provided by the second expression for the coefficients: 


Cm = (— 1)" tr "a, 
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where /\"u is the m-th exterior power of u, acting on /\"M, and M may be 
assumed to be a finite dimensional vector space. But in that case, one finds 
at once that 


tr /\"u= > dets(A), 


|S|=m 
by looking at the trace with respect to the basis 
{e;,, A+++ Ae, }. 


This proves that the coefficients c,, have the value as stated. In particular, 
c, = —tru (sum of the diagonal elements), so 


det — tu) =1-—(trujt+-::. 
Now suppose E is a Banach space with Banach basis (e,), i € I. We wish 
to define det(I — tu) for any u € CC(E, E). Suppose first that |u| < 1. Let 
E, be as before, the subspace of elements x in E such that |x| < 1. Then 


u(Eo) © Eo. For each positive integer n, the endomorphism u defines an 
endomorphism 


u("): Eo/t"E9 > Eo/n"Eg. 


We denote Ey/x"Ey = Eo(x"). If (a;;) is the matrix of u with respect to the basis 
(e;), then we know that 


|u;| = sup |qj,| 0. 


Hence there exists a finite subset S of indices j such that for all i and all 
j € S, the components a;; lie in the ideal (x"). Hence the polynomial 


det(I — tu(x")) 


is defined, as a polynomial in R/z"R. As n tends to infinity, these polynomials 
form a projective system, and we define 


det(I — tu) 


to be their limit. It is a formal power series, in R[[t]]. 

Ifuis arbitrary in CC(E, E), we can find an element c € R,c # 0, such that 
|cu| < 1. Then det(I — tcu) is defined, as a power series D(t), and we define 
det(I — tu) = D(tc~ 1). This is independent of the choice of c. 
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Proposition 5.5. Let u be a completely continuous endomorphism of the 
Banach space E. Let (e;);- 1 be a Banach basis, and let A = (a;;) be the matrix 
of u with respect to this basis. Let 


det(I — tu)= ¥ ct”. 
m=0 


Then: 


(i) Cm = (—1)” x dets(A). 


where S ranges over subsets of I with m elements. 

(ii) Given a positive number r, we have |c,,| <r” for all m sufficiently 
large. 

(iii) If {u,} is a sequence in CC(E, E) and u, — u, then 


det(I — tu,) > det(I — tu), 


where the convergence is the simple convergence of coefficients. 
(iv) If u has finite rank, then det(I — tu) is the polynomial defined from 
elementary linear algebra. 


Proof. Suppose first that |u| < 1. Then the formula for c,, is valid for each 
u(x"), and so remains valid in the limit. The general case follows by using 
c # 0 such that |cu| < 1. 

For (ii), let S be a finite subset of m elements. Each product in the expansion 
for the determinant det,(A) contains m elements, indexed by m distinct 
indices j,,...,j,- Let 

rj; =|u;| so r;>0. 


Then any product of m elements as above is bounded in absolute value by the 
product r, ---r,,. Hence 


|dets(A)| <r,---r,, for each S, 
so 


(Cpl Sle Nee 


Since r; > 0 it follows that r; < r for all j sufficiently large, and (ii) follows. 
The third assertion follows trivially from (i). 
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Finally, suppose u has finite dimensional image. After replacing u by a 
scalar multiple, we may assume that |u| < 1. Let F be a finite dimensional 
subspace of E containing u(E). Let 

Fy =FOE, and F(x") = Fo/n"Fo. 


Then Fo is a direct summand of E,, and hence F(z”) is a direct summand of 
E(x"), and is a free submodule. Hence the determinant 


det(I — tu(x")) 
can be computed in F(z"). Since 
det(I — tu) = lim det(J — tu(x")), 
it follows that 


det(I — tu) = det(I — tup), 


where ur, is the restriction of u to F. Then (iv) follows, and the proposition 
is proved. 


By the estimate of (ii) in the proposition, we conclude that the power series 
det(/ — tu) represents an entire function of the variable t for values in K, or 
in any complete extension of K. In particular, det(J + u) is defined for all 
ué CCC(E, E). 

Corollary 1. Let u, v € CC(E, E). Then 

det((J — tu) — tv)) = det(I — tu) det(I — tv). 

Proof. After multiplying u, v by appropriate scalars, we may assume without 

loss of generality that |u| < 1 and |v| < 1. In that case, we view u, v as acting 


on Ey, and reduce mod 2", in which case the formula is true when u, v are 
replaced by u(x") and v(x") respectively. Taking the limit yields the corollary. 


Corollary 2. Let u€ CC(E, F) and v € L(F, E). Then 
det(UI — tuc v) = det(I — tvo u). 


Proof. By (iii) and (iv) we may assume that u, v have finite rank, in which 
case the assertion is standard by the linear algebra of finite dimensional 
spaces. 
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We define the trace tr(u) of an element u € CC(E, E) as the coefficient of 
—t in det(J — tu). If (a;;) is the matrix associated with u with respect to a 
Banach basis, then as usual, 


tr(u) — Y ais. 
This formula shows that |tr(u)| < |u|. 


Corollary 3. Assume that K has characteristic 0. Then 
«a t” 
det(I — tu) = exo( - ¥ tr(u”) ). 
m=1 m. 


Proof. The assertion is true if u has finite rank by ordinary linear algebra. 
Let {u,} be a sequence of such endomorphisms converging to u. Then both 
the right-hand side and left-hand side with u replaced by u, converge to the 
corresponding expressions of the corollary, thus establishing their equality. 


Note. The above three corollaries are included to show how to apply 
Proposition 5.5. They were not needed in the applications of the preceding 
sections. Only the following proposition was needed. 


Proposition 5.6. Let 
0- E'> E-E"->0 


be an exact sequence of Banach spaces, and let u', u, u" be continuous linear 
maps making the diagram commutative: 


0 > EB’ > E—> E”"— 0 


| | 4 


0 > E’ > E > E” +0 


If u is completely continuous, so are u' and u", and we have 
det(I — tu) = det(I — tu')det(I — tu”), 
tru=tru’t+ tru’. 


Proof. We may view E' as a subspace of E, and wu’ as the restriction of u 
to E’. Thus u” is the map induced on E” as factor space E/E’. By the Corollary 
of Proposition 5.2 there exists a Banach basis {e;} for E’ and a Banach basis 
{e/} for a complementary subspace such that the images of {e’ }in E/E’ form 
a Banach basis. Then {e/, e7} forms a Banach basis for E. From this it is clear 
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that if u is completely continuous, so are u’ and u”. Reducing mod 2" yields the 
desired identity for u(x"), u'(x"), and u"(x"), whence the identity as in the 
proposition for u, u’, u”. The identity for the trace follows from that of the 
determinant since 


det(1 — tu) = 1 —(tru)t+-:-. 


This concludes the list of properties of the determinant which has been 
used in the Dwork theory. 
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Gauss Sums and the 
Artin—-Schreier Curve 


In this chapter we establish the connection between the spaces used to 
represent the Frobenius endomorphism, and eigenspaces for a Galois group 
of automorphisms of the Artin-Schreier curve. This connects Dwork theory, 
the Washnitzer-Monsky theory, and the fact realized by Monsky that the 
series E(x) can be used to construct explicitly such eigenspaces. The first 
section lays the foundations for the special type of ring under consideration. 
After that we study the Artin-Schreier equation and the Frobenius endo- 
morphism. 


§1. Power Series with Growth Conditions 


We let: 
R = discrete valuation ring of characteristic 0, complete, 
with prime element z dividing the prime number p. 
K = quotient field of R. 
k = R/x = residue class field, which has characteristic p. 


R«€x» = set of power series 
(1) g(x) = Ya,x" witha,eR 
n=0 


such that there exists 6 > 0 for which 


ord, a, > 6n for all n sufficiently large. 
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It is clear that R€x)» is a ring, which will be called the Washnitzer-Monsky 
ring. Furthermore, the elements of R€x» are precisely the power series 
which converge (absolutely) on a disc of radius > 1. Of course, the radius 
depends on the power series, there is no uniformity required. Later, we shall 
deal with the Dwork spaces, taking the uniformity into account. Indeed, 
it will also be useful to consider power series such that a finite number of 
coefficients may lie in K. 

Pick 6 rational > 0, and let x denote any rational power of z, well defined 
up to a root of unity in the integral closure of R. If p(x) as above is an element 
of R&x>, then we may write 


(2) (x) = Yi b,(n°x)" 

with coefficients b, (possibly algebraic over R) which are z-integral for all 
but a finite number of n. Conversely, if a power series g(x) in R[[x]] has a 
representation as in (2) with coefficients b, whose denominators are bounded, 
then it is clear that g(x) € R€x). 


We let Rx), be the p-adic completion of R€x». Then R€x), is the ring 
of power series 


Via wager, 
such that a, > 0, under the sup norm of coefficients. To prove that this is 
the completion, we observe that this ring R€x), is p-adically complete, and 
that R€x)» is dense in it. Furthermore, the polynomial ring R[x] is dense, 
so that 
R«x), = RIX], 


is also the p-adic completion of R[x]. 
The ring Rx), still has the advantage that 


R«x), mod x = k[x], 


in other words, its reduction mod z is the polynomial ring. 
The following property is immediately verified. 


Let w € R€x) (respectively R€x),) be such that 
w = 0 mod xx. 


Then the geometric series gives an inverse for 1 — win R€x) (respectively 


R«x)»). 


Lemma 1.1. The ring R€x), is integrally closed. 
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Proof. An element (x) of R€x), not divisible by x can be written 
Q(x) = bo +++ + byxt + °°: 


such that b, is a unit, and b, = 0 mod z for n > d + 1. Then the Manin 
proof of the Weierstrass preparation theorem given for instance in 
Chapter 5, §2 applies to show that 


p(x) = P(x)u(x), 


where P(x) = x4 +--+ + cg € R[X] is a polynomial, and u(x) is a unit in 
R«€x»,. (In that reference, the crucial step occurs when we assert that t(f) 
is invertible, which is true in R€x),, by the remark made before the lemma.) 
This implies that the zeros of P(x) all lie in the unit disc in the algebraic 
closure of the quotient field of R. 

If the quotient ¢(x)/W(x) of elements in Rx), is integral over R€x),, 
then we use the Weierstrass preparation theorem to write 


A) ge POD sey 


Wx) Q(x) 


where P(x), Q(x) are polynomials, and u(x) is a unit in R€x),. Then without 
loss of generality, we may assume that u(x) = 1. Since elements of R€x), 
converge on the unit disc, if Q(x) does not divide P(x), we may evaluate the 
right-hand side at a zero of Q(x) which is not a zero of P(x) to make the right- 
hand side infinity. An elementary criterion for integrality then shows that 
o(x)/(x) cannot be integral over R€x),, which proves the lemma. 


Lemma 1.2. The ring R€x)» is algebraically closed in R€x},, and therefore 
it is integrally closed (in its quotient field). 


Proof. As pointed out to me by Dwork, this lemma can be viewed as a 
special case of a result in Dwork-Robba [Dw-Ro], Theorem 3.1.6. The proof 
given below was derived in collaboration with Dwork. 

Let A = R€x» and A, = R«x),. Let ye A, be algebraic over A, satis- 
fying a polynomial equation 


F(y) = 0 


with coefficients in A. By the x-topology, we mean the topology of formal 
power series (high powers of x are close to zero). Given a € A,, if a’ is x-close 
to a, then «’ is also p-close to «. 

Let yo be a polynomial, in R[x], which is x-close to y. Let || _ || be the 
Gauss norm on power series (sup norm of the coefficients) extended to the 


362 


§1. Power Series with Growth Conditions 


quotient field K(A). Then F’(yo) € A, and in particular is holomorphic on a 
disc of radius > 1. Then for yo sufficiently close to y, we have 


| F(Yo) 
F ‘(Yo)* 
Hence the Newton sequence 
_,, _ FOr) 
Yn +1 Yn F'(y,) 


converges in the Gauss norm in the completion of K(A), and in fact to y 
itself since A, is complete, if we pick y, sufficiently close to y (closer than any 
other root of F in the completion of K(A)). Cf. for instance my paper, “On 
quasi algebraic closure,” Ann. of Math. (1952) pp. 373-392; or also my 
Algebraic Number Theory, Chapter II. 

We shall now use another norm to see that the sequence converges to a 
holomorphic function on a disc of radius > 1 + «¢ for some ¢, from which 
relatively small sets have been deleted. 

To avoid introducing a new letter, let us view K as a subfield of C,. 
If t is a real number > 0, we let D(t, 0) be the closed disc of radius t around 
the origin in C,. Let a,,..., a, be elements of this disc, and let r,,...,1m_ > 0. 
We let 


B= B(t; r, a) = set obtained by deleting from D(t, 0) the union of the 
balls D(r;, «;) of radius r;, centered at a,. 


In the sequel, we assume that t > 1 andr {<i 
If H(x) is a rational function with no poles in a set B as above, then from 


its factorization into linear factors, we see that the norm 


|All = sup | H(x)| 


is defined. 
Lemma 1.2.1. For any rational function H holomorphic on B, we have 
|H || < | Alls. 


Proof. We factor the rational function into a product of a constant factor 
and linear factors of type 


1 . 
x-a and ——, with ae C,. 
x-a 
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The Gauss norm of x — a is max({1, |a|). We pick x € B to be a unit which 
is not congruent to any a mod m,. For such x we have 


|H(x)| < ||Hllp. 
and the lemma is then obvious. 


Lemma 1.2.2. Given a rational function H(x), with ||H|| < 1, there exists 
a set B(t; a, r) with t > 1 andr; < 1, such that 


|H\lp <1. 


Proof. Factor 


A(x) =c[]G — ax)" [] & — 6)” 


where |a;| < 1 and |b,| < 1. Then |c| < 1 because ||H|| < 1. Let N be the 
number of linear factors, counting multiplicities. Let s > 1 be such that 
s%c < 1. It will suffice to find B such that each factor of H has B-norm < s. 
We consider factors of four types: 


1—ax, (1—ax)', x—b, (x—b)7! 


with|a| < land|b| < 1. For factors of the first three types, it suffices to select 
the radius t of B to be <s and sufficiently small > 1. For the factor of last type, 
namely (x — b)~', it suffices to delete from B a disc of radius 1/t’, where 
t' <tand ¢’ is very close to t. This proves the sublemma. 


We now return to the proof of Lemma 1.2. We define: 


Hol(B) = completion of the ring of rational functions 
having no poles in B, under the B-norm. 


Since the Gauss norm is bounded by the B-norm, every Cauchy sequence 
for the B-norm is a Cauchy sequence for the Gauss norm. Consequently 
there is a natural injection of the completions 


Hol(B) as K(X)gauss: 
Observe that A, = R«x), is contained in the Gauss completion. 


Given our element y € A, algebraic over A, we first select a polynomial 
yo € R[x] as in the beginning of the proof, so that 


| F(yo) 
F'(¥o)” 
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is small, and in particular is < 1. We let H(x) = F(yo)/F’(yo)” be this rational 
function. We then select B as in Lemma 1.2.2 so that ||H||, < 1. Then the 
Newton sequence also converges to an element of Hol(B). This implies 
that y is the power series expansion on the closed disc of radius 1 of a 
holomorphic function on B. It is mow a matter of foundations of p-adic 
analytic functions that the power series for y also represents the analytic 
function on some disc of radius > 1. This implies that the coefficients of the 
power series y = g(x) tend to 0 like some geometric series, thereby proving 
Lemma 1.2. 

For a proof of the foundational fact we have just used, see for instance 
Amice [Am]. Note that the power series for y may converge only on a disc 
of radius smaller than the radius of the original set B. For instance, the 
holomorphic function may have a finite number of poles near but outside 
the circle of radius 1. The power series will converge only on a disc which 
does not contain these poles. The proof requires the p-adic analogue of the 
Mittag-Leffler theorem, in lieu of analytic continuation over the complex 
numbers. 

Let fo(Y) be an irreducible polynomial of degree d, with coefficients in 
k(x], leading coefficient 1. By a lifting of f we mean a polynomial f(Y) in 
R«€x»[Y] of the same degree, leading coefficient 1, such that 


fo = f mod xz. 


Then f is necessarily irreducible over R€x),. Indeed, the coefficients in a 
factor are integral over R&x), so in R¢x» by Lemmas 1.1 and 1.2. Such a 
factor then reduces to a factor of fo. 

Let yo be a root of fp and let y be a root of f. Let 


Af = R€x>[y], and Ao = k[x, yo]. 
Then there is a unique homomorphism 
A > KX, Yo] = Lo 


reducing R¢€x» mod z, and sending y to yo. This is a standard fact of ele- 
mentary field theory. Thus the ideal () in R€x» extends uniquely to a prime 
ideal of , and 


AG. = A modn. 


We view / as defining an affine curve V and fo as defining its reduction 
mod z. Thus we shall write 


B= AV) and A/n = Ao(Vo). 


Then (Vo) is the ordinary affine ring of V. over the field k = R/z, but 
x = WV) is a more complicated ring, arising from the work of Washnitzer 
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and Monsky, following Dwork. We shall also say that V is a lifting of Vo, 
corresponding to the lifting f of fo. 


Lemma 1.3. Let we. and assume w =0 mod za. Then the geometric 
series 


l+wt+w+--- 
converges to an inverse of 1 — win &. 


Proof. For convenience, assume that the lifted polynomial f(Y) has 
coefficients in R[x]. This is all that we need in the applications, and the proof 
is slightly easier in this case. We write 

d-1 


w= >; gilx)y" 
i=0 


with g,(x) € R€x». Then for some 6 > 0 we also have 
d-1 : 
w= Y h(x) (nyy, 
i=0 


where h; are power series with coefficients in the algebraic closure of R, 
and all of these coefficients are divisible by the small power z*. Furthermore, 
for each positive integer n, we can write 


d-1 : 
y" = oY Qn, (x)y" 
i=0 


where ¢,,; is a polynomial of degree <n (<some constant times n, the 
constant depending only on f). It then follows at once that there exists ¢ 
such that if we write 


d-1 
we= DD epix’y 
i=Q j=0 


then ord c{”, > e(j + n). This proves the lemma. 


We shall say that Vo or fo is special if f (yo) * € kLx, yo]. It then follows 
that 


fy ted. 
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Indeed, let z ¢.oW be such that z = fo(yo) | mod x. Then 
2f'(y) =1+, 


where we. and w = 0 mod z. Applying Lemma 1.3 proves our assertion. 


Lemma 1.4. Assume that fo is special. Then © is integrally closed in 
R«x),[y], and also in the quotient field K of J. 


R«xy,Ly] 
SD = R«xy[y] R«x), 


R«&x) 


Proof. The powers y’ (j = 0,...,d — 1) forma basis of K over the quotient 
field of R&x. The dual basis with respect to the trace is contained in 
f(y) ‘&@, and hence in ». If an element z € R€x),[y] is integral over #, 
and we write 


d-1 


z= Lgioy! with g(x) € R&x),, 


then the coefficients g,(x) can be expressed as traces, 
g(x) = Tr(zy)), 


where {y}, ..., ya} is the dual basis. Since each zy; and its conjugates are 
integral over R€x», so is the trace, which lies in R€x» by Lemma 1.2. 
Hence z € R€x»[y]. The same argument shows that . is integrally closed. 
(All of this is standard elementary theory of fields and integral closure, as in 
algebraic number theory.) 


We shall need to lift roots of polynomials, as expressed in the next lemma. 
We let Q be the quotient field of R¢x». 


Lemma 1.5. Let V be a lifting of Vo as above, and assume that Vo is special. 
Let 


ft = R&xyLy] > kL, yo] 
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be reduction mod 1. To each root Z of fo in k[x, yo] there exists a unique 
root z of f in & such that 


zmod x = Z. 


If in addition k(x, yo) is Galois over k(x) with group Go, then Q(y) is Galois 
over Q. Say G is its Galois group. Then the map 


ot+o mod x 
gives an isomorphism of G with Go. 


Proof. Let z, € @ reduce to z mod z. Such z, exists because reduction 
mod z gives a surjective homomorphism of © onto k[x, yo]. We can find 
w € & such that 


wf (z1) = 1 mod x. 
We then define the usual sequence 


Zn+1 = Z_ — Wh(Zq)s 
We obtain 
S(2n+1) = fn) — Wh (Zn) fn) mod f(z,)?. 
It follows by induction that 
S(2,) =9 mod xn" and z, =z, mod z. 
The limit z of the sequence {z,} then a priori lies in the ring 
R«x),Ly], 


and is a root of f in that ring. By Lemma 1.4, we conclude that the root 
actually lies in R€x)[y] = &, as desired. 

Under the additional assumption of Galois extensions as stated, this proves 
that . is Galois over R€x». Since the prime z remains a prime ideal in .o~, 
by standard decomposition group arguments we see that G is the decom- 
position group of the prime, and we get the isomorphism of G with Gp by 
reduction mod z. Cf. Proposition 14, Chapter I, §5 of my Algebraic Number 
Theory. 


The last lemma is only a special case of a more general situation having 
to do with the possibility of lifting morphisms. What we use in connection 
with the Frobenius morphism is summarized in the next lemma. 
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Lemma 1.6. Let W,, Vo be two special affine curves, defined over k by poly- 
nomials go, fo respectively. Let W, V be liftings, defined by g, f. Let 


Po: Lo(Vo) > L(Wo) 
be a homomorphism of the affine rings, such that 


@o(x) € xk[x]. 
Then there exists a lifting homomorphism 
gp: A(V) > A(W) 


of Po. If p(x) € RK x) is a lifting of @o(x), then there is a unique choice of 
@(y) lifting @o(y) and making ¢@ a p-adically continuous homomorphism. 


Proof. Suppose (x) lifts @o(x). Then we can define a unique p-adically 
continuous homomorphism R¢x» > R«x» by 


Y a,x" Ya, P(x)". 


Let gf be the polynomial obtained by applying ¢ to the coefficients of f, and 
let (@f)o be its reduction mod z. Then (@f), has a root Z in W%,(W), namely 
Z = Po(Vo). Let z; € W(W) reduce to Z mod z. As in the proof of Lemma 1.5 
we then find w in » such that 


w(@f)'(z1) = 1(mod 7), 
and the same argument as in Lemma 1.5 then shows that z, can be uniquely 


refined to a root of of in (W). This concludes the proof. 


§2. The Artin-Schreier Equation 
The example we shall consider is given by the equation 
yee ie ae 


where N is a positive integer prime to p. If k is a field of characteristic p, then 
Y? — Y — x" is irreducible. (For instance, a root is ramified of order p over 
x = 0.) Furthermore, putting fo(Y) = Y’ - Y— x” (as a polynomial 
over k), we have 


fo(Y) = py? 2-1 =-1., 
Viewing now 
f(Y) = Yy? —- Y— x" 
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as a polynomial in characteristic 0, i.e. with coefficients in R, then for the root 
y of f(Y) we have 


f'Q) = py’? ~ 1. 


We see that fo (or the affine curve V)) is special as we defined it in the last 
section, and that it satisfies all the hypotheses of the lemmas, which are there- 
fore applicable. We call V, the Artin-Schreier curve. 

The equation Y? — Y — x% = 0 has a formal solution both in character- 
istic p and characteristic 0. Indeed, it has the approximate solution 
Y = Omod x, and since f’(0) = —1, the Newton sequence of approximate 
solutions converges in the formal power series to a unique solution h(x) 
such that h(O) = 0. In fact, 


W(x) = —xN +--+. 


In characteristic p, it has the group of automorphisms G,(p) isomorphic 
to Z/pZ, sending 


Yot? Vo + 4, for «ae Zp). 


By Lemma 1.5, there is a unique automorphism o, of # leaving R€x» 
fixed such that 


oy) = y + a(mod z). 
We let G(p) be the lifting of Go(p) as a group of automorphisms of .o~, as in 


Lemma 1.6. The map a+ «a, is an isomorphism. 
On the other hand, we have a lattice of rings: 


G(N) R«x)[y] Ge 


R«x*)[y] R«xy 


G(p) 
R«&x") G(N) 


From now on, assume that R contains the N-th roots of unity. Then R€x» 
is Galois over R&x‘ with group G(N) isomorphic to a cyclic group of 
order N, sending 


xe tx for le py. 


The two extensions corresponding to the two bottom sides of the parallelo- 
gram are linearly disjoint (being of relatively prime degree), and so opposite 
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sides of the parallelogram have isomorphic Galois groups. In particular, the 
ring 


& = R€xy[y] 
admits the group 
G = G(p) x G(N) 
as a group of automorphisms, whose fixed ring is R€x”». The fixed rings of 
G(p) and G(N) are R«xy and R¢x"»[y] respectively. 
The character group G consists of character (, y), where w is a character 


on Z(p) and x is a character which may be identified with a character on roots 
of unity. We write 


X= Xj 
ify =C,withO<j<N-1. 


If (W, v) is a character of G, and M is a G-module, we let M(y, x) be the 
eigenspace corresponding to this character. 


Lemma 2.1. We have (1, x;) = x/R€&x™), for0 <j<N-—-1. 
Proof. Obvious. 
We now let 2?-1 = —pand 

R = Zn, py]. 


In the previous chapter, §3, we had associated with each z such that 
n’~! = —pacharacter w, of Z(p), and the Dwork power series 


E,(X) = exp(a1X — 1X°). 
It follows from Lemma 2.2 of the preceding chapter that 


E,(y)€ R&y) < &. 


Lemma 2.2. The element E,(y) is an eigenvector of G(p) with eigencharacter 
y,,- In other words, for « € Z(p), 


OEY) = Wr(X)E,(y). 


Furthermore, E,(y) is a unit in R¢y). 
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Proof. For p odd, the inverse of E,(y) is E,(—y), which is thus also in 
R«¢y>. If p = 2, one has to apply Lemma 1.3. Next we verify that a, E,(y) 
and E,(y) differ by a p-th root of unity. We have: 


(o,E,(y))? = o,(E,(y)’) 
6, exp(pry — pry?) 
d, exp(— px’) 


exp(—mpa,x") = exp(—pmx") = E,(y)?. 


Hence 


o,E,(y) 
E,(y) 


EM,. 


The Galois theory of Lemma 1.5 applied to 


R«x*)[y] = R¢yy 


shows that a, is an automorphism of R¢y). Hence o, E,(y) is a power series 
in y, and in fact, 


5,E,(y) = E,(o,y) =1l+ TO, (mod nm’) 
1+ my + ma(mod 2”). 


On the other hand, 
E,(y) = 1 + ny(mod 7). 
Taking the quotient shows that 


OEY) _ 
boy 1 + ax(mod 77). 


This proves that the quotient on the left-hand side is equal to w,,(«), as desired. 


The units o, E,(y) will allow us to determine an eigenspace decomposition 
for J, except that we need p in the denominator of the orthogonal idem- 
potents of the group ring G(p). Hence we let K be the quotient field of R, 
so that 


K = R[1/p]. 
Then we shall abbreviate by 7, the ring 


Ay = A{\[p] = of @K. 
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For each « € Z(p) we suppose given a unit E, € such that E, is an eigen- 
vector for G(p) with eigenvalue w,(«). The family of units o,£,(y) forms 
one example, but there is another, equally natural, namely the family of units 

E,(y)' withi =0,...,p — 1. 


Since E,(y) has eigenvalue w,(1), it follows that E,(y)' has eigenvalue w,(1)'. 


Theorem 2.3. We have the eigenspace decomposition 


AK = () E,: R&x)x 


ae Z(p) 


or also 
bg = R&D ® @ ELROD. 


Proof. Note that x is free of dimension p over Rx»), x. Each eigenvector 
provides for a one-dimensional subspace, and hence their sum (necessarily 
direct) is the whole space 7x. 

The fact that Rx» is the fixed subring of G(p) implies that the eigenspace 
for the trivial character is precisely 


R«x)x. 
Recall that not only G(p) acts on & but also G(N), and so the group 
G = G(p) x G(N). 


A character of G(N) can be viewed as y; with 0 <j < N — 1, and x; is non- 
trivial if and only if1 <j<N— 1. 


Theorem 2.4. Let0 <i<p—1and0<j <N — 1. Then 
A Wa, Xj) = XIE Ay)’ RK «- 


Proof. Obvious from Lemma 2.1 and Lemma 2.2, and the fact that E,(y) 
is fixed under G(N). 


Remark. The use of the Dwork power series E,(X) to obtain eigenspace 
decompositions on spaces associated with the Artin-Schreier curve is due to 
Monsky, cf. [Dw 5], last section of the paper. 
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§3. Washnitzer-Monsky Cohomology 


Cohomology of differential forms with growth conditions on the coefficients 
was considered long ago by Dwork [Dw 1], [Dw 2]. The particular co- 
homology considered here is due to Washnitzer-Monsky [Wa-M], [M 1], 
[M 2], following the work of Dwork. 

The K-algebra ./, is finite separable over R(x. We have the ordinary 
differentiation d/dx on the power series in x. This differentiation extends in a 
unique way to &, because from the relation f(x, y) = 0 with coefficients 
in R, we get 


D, f(x, y) dx + Dz f(x, y) dy = 0, 


so 


Di f(% y) 


— ix. 
Dif y) 


dy = 
Here, if z € , then dz denotes the functional on derivations arising from 
the pairing 
(z, D) Dz. 


Since we are dealing with a special curve, we know that D, f(x, y) is invertible 
in «. Consequently the above formula expressing dy in terms of dx is valid 
over .. (For an elementary discussion of the foundations of the theory of 
derivations, cf. my Algebra, Chapter X, §7.) 

We let 


Q(x) = xf dx and QU ,) = Axl) = Hy dx 


be the spaces of l-forms. We let the Washnitzer-Monsky cohomology group 
be 


H(A) = H\(oA) = g/d. x. 


Note that the differential d acts like 0 on K, so 
d(cz)=cdz forzéeW,ceK. 


We are interested in the eigenspace decomposition of H}(.o/) with respect to 
G = G(p) x G(N). We note that the differential 


dx 


x 
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is invariant under G. The theorems of the last section give an eigenspace 
decomposition into three pieces: 


p-1N-1 


Ax = RKx)K ® @ E,(y)' ‘RKx”) x ® ® ® E,(y)ixIRKXNY x. 


These three pieces correspond to: 


(i) trivial action by G(p); 
(ii) trivial action by G(N); 
(iii) direct sum of (Wi, x;) with both wi and x '; non-trivial. 


In abbreviated notation, this direct sum can be written 


A= AR” @ AR) ® BHR, », 
v#1 


Y#1 
where .%"®) is the second piece in the above direct sum. Note that 
AY AE" = RK", 


whence the need for a subdivision of »§) into two smaller subspaces to 
be able to write the direct sum as above. 
We now want a similar decomposition for Q,. 


Lemma 3.1. We have: 


(i) NE” = R&xyx dx = LE” dx; 


(ii) QF” = ol sees ae Ox; 


(iii) for non-trivial (. x we have 


OW, 1) = hel 0 


Proof. Write a differential form as 
p-1 ; 
dX gix)y' dx. 
i=0 


Invariance under G(p) implies that g; = 0 if i > 1, and conversely, so (i) is 
clear. If ¢ is a primitive N-th root of unity, then d({x) = Cdx. Hence invariance 
under G(N) implies that 


Cg(¢x) = g(x) fori=0,...,p—1, 
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or equivalently 

gi(Cx) = 6 *gi(X). 
If N = 1 then assertion (ii) is clear. If N > 1, then this last relation implies 
that g,(x) has no constant term, and that in the power series expansion, all 


terms are zero except those involving x” with n = — 1 (mod N). Then (ii) is 
also clear in this case. For (iii), we write an element of Q,(, x) in the form 


g(x, y) “ 
with g(x, y) € Wx. Since dx/x is invariant under G. it follows that 
G(x, YE A R(W, 7), 
and g(x, y) is divisible by x because j > 1 if x = x;, so the lemma is proved. 
Lemma 3.2. The differential operator d maps: 


d: AE) ery ag”) 


d: AEN) + OEM) 
dx 
d: AY, ya) ag AW, v4) x = OY, » for w # I, x # 1. 
Proof. The first inclusion is clear. For the other two, we have 


dE,(y) 
E,(y) 


= d(ny — my”) = d(—1x") = —nNx%~! dx. 


Since yt > dg/g is homomorphic, we get 


(1) ay = ~innx® 


Using the rule for the derivative of a product, we then also easily find for 
any (x) € R&x)»: 


(2) dE ,(y)x!g(x*)) = E,(y)'x(Dj, QP) ON)N = 


where D,,; is the differential operator on a power series (x) given by 


d Z 
Dyy = x= — inx +. 
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We shall work with i = 1, and we thus let 


This proves the lemma, and in addition gives explicit formulas for the 
differentials, summarized by the following commutative diagram. 


R&x)x —> XE,(y)RKX*) 


np, | |: 


REY > VERO S 


The horizontal map on top sends 
(x) + XIE,(y) p(x), 
and similarly on the bottom, with dx/x on the right-hand side. 
Recall from Theorem 2.4 that for 1 < j < N — 1, we have 
A Was Lj) = EAy)RKX™) x. 
Theorem 3.3. 


Gi) For W # 1 and x # 1 we have 


d 
HAN 1) = Aes D / df xl, 1). 
(ii) For 1 <j < N — 1, we have an isomorphism 
R&xx/D;R&XYx > H'(w,, %) 
given by 
; ny aX 
9(x) > XE, (y)o(x Nea 


Proof. The first assertion is clear from Lemma 3.2. The second comes 
from the above diagram and Lemma 3.2. 
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The affine ring Z[x, y] has a natural embedding in the power series ring, 


Z[x, y] > Z[[x]], 


mapping y on a uniquely determined power series y = h(x), such that 
h(O) = 0. We have already mentioned this formal solution, and the fact that 


h(x) = —x" 4. 


Since f(Y) is irreducible over R€x» and its quotient field (for instance 
by the Galois theory), the root h(x) of f(Y) in R[[x]] gives rise to the em- 
bedding 


ff = R€x>[y] > RID]], 
sending y on h(x). This gives rise to a natural homomorphism 
H(A x) ——> H*(REEx]]x) 


Ay Ax/dt_ REx] x 4x/dRED JI. 
Theorem 3.4. We have an isomorphism 
H(A) (W, Xj) > Ay, 
where H,_,, is the representation space of the last chapter. 


Proof. Clear from Theorem 3.3. 


§4. The Frobenius Endomorphism 


If g is a lifting of a morphism @, of special varieties, then we let p* = H}(@) 
be the induced homomorphism on the cohomology groups. It can be shown 
that p* is independent of the lifting, but we shall not need this here. 

We shall deal especially with automorphisms of the Artin—Schreier curve 


o = (6,,0;) withae Z(p) and € py. 


Such o€G = G(p) x G(N) is an automorphism of # = R«x)[y]. We 
have 


o* = (of, of). 


378 


§4. The Frobenius Endomorphism 


We also have the Frobenius endomorphism 
Fo: L(Vo) + Lo(Vo) 


such that F(x, yo) = (x?, y8) in characteristic p. By Lemma 1.6 we know 
that it can be lifted uniquely to an endomorphism 


F: — of such that F(x) = x?, 


extending by K-linearity to an endomorphism of .% x. 
In characteristic p, that is on Vo, we obviously have 


(1) F (04, 0) = (0,, of) °F. 
Indeed, in characteristic p, (,, ,)(X, Yo) = (x, Yo + &), so 
Fo ° (o,; ox)(x, Yo) = (CPx?, yb + a?) 
= (CPx?, yh + a) 


= (0,, Of)(x?, y?) 
= (Ga; a?) 2, Fo(x, Yo). 
The commutation rule in characteristic zero follows by the uniqueness 


of liftings of homomorphisms when the x-value is prescribed. 
From (1) we get on the cohomology 


(1*) (04, 0;)* 0 F* = F* (a, af)*. 
In particular, on an eigenspace we find that for W ¥ 1, y ¥ 1, 
F*: HRW, x) > AR, x”). 


Now we wish to see what happens to the Frobenius endomorphism under 
the embedding of in R[[x]]. Mutatis mutandis, we know that h(x?) is 
the unique solution in power series with zero constant term of the equation 


T? — T = x, 


Theorem 3.4 now shows that the representation of F* on the distinguished 
elements 


; dx 
XE (y) = 
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corresponds to the representation on the elements 
, dx 
x/ exp(—2x%) — 
x 


arising from the last chapter. Thus we find: 


Theorem 4.1. The eigenvalue of F* on H'(.,)(W,, 7) is the same as the 
eigenvalue of Df on H, x- 
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The Stickelberger theorem giving the factorization of Gauss sums, the Gross- 
Koblitz formula, and the Davenport—Hasse distribution relations will 
be combined to interpret Gauss sums as universal odd distributions 
(Yamamoto’s theorem). 

On the other hand, Diamond [Di 1] and Morita [Mo] gave a value of 
L,(0, x) in terms of the p-adic gamma function. Ferrero—-Greenberg gave a 
variation of Diamond’s formula, and used it to show that L{(0, x) # 0 
under the appropriate conditions. The value L;(0, x) is essentially the gener- 
ator for the y-eigenspace of the Stickelberger distribution, but the proof of the 
non-vanishing requires the analogue of Baker’s theorem (Brumer in the p-adic 
case), as in the proof of the non-vanishing of the p-adic regulator of cyclotomic 
fields. This is combined with the linear algebra of distribution relations, 
especially in the composite case. Cf. Kubert-Lang [KL 5]. 

The formula for L;(0, x) will be derived by an elegant method of 
Washington [Wa 3], who gives the p-adic analogue of the partial zeta 
functions. Over the complex numbers, the coefficients in the expansion at 
s = | are themselves interesting functions (of gamma type), which appear 
thus in a natural way as homomorphic images of the partial zeta distributions. 
The same thing happens in the p-adic case. 


§1. The Universal Distribution 


We assume that the reader is acquainted with Chapter 2, §8, §9, §10. In 
particular, suppose that N > 1 is an integer. Let (Q/Z)y = (1/N)Z/Z, 
and let 


9:(Q/Z)n > A 
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be a function into some abelian group A. Such a function is called an ordinary 
distribution—distribution for short—if it satisfies the condition 


M-1 i 
—})=q(M 
¥ a(x i a) g(Mx) 
for every divisor M of N, and all x € (Q/Z)y. 
We let F(N) be the free abelian group generated by (Q/Z)y. We let 
DR(N) be the subgroup of distribution relations, that is, the subgroup 
generated by the elements of the form 


M-1 . 
y (s 4. i) — (Mx), for all M[N. 


i=0 


We let U(N) = F(N)/DR(N) be the factor group, which we call the universal 
distribution of level N. The natural map 


(Q/Z)y > UN) 
is then universal for distributions into abelian groups in the obvious sense. 


Given a distribution g on (Q/Z)y, there exists a unique homomorphism g, 
making the diagram commutative: 


U(N) 
wae 
(Q/Z)y |« 
™~ | 


Kubert’s theorem (Chapter 2, Theorem 9.2) asserts that U(N) is free on 
@(N) generators. 


Theorem 1.1. Let g be a distribution as above. Let K be a field of character- 
istic 0. Assume that the distribution obtained by following g with the natural 
homomorphism 


A>A®K 


has K-rank @(N), in the sense that the dimension of the vector space gener- 
ated by the image of (Q/Z)y has dimension @(N). Then g is the universal 
distribution. 


Proof. The rank of the image is at most #(N). If the vector space generated 
by the image has that rank, then the Kubert generators must remain free 
under g and the tensor product, so they must be linearly independent over Z 
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in the abelian group generated by g((Q/Z),). Hence the canonical homo- 
morphism from the universal distribution to g must be an isomorphism, 
as was to be shown. 


Let 
G(N) = Z(N)* 


be a group isomorphic to Z(N)*, the isomorphism being denoted by 


6.<°C. 


c 


We think of G(N) as the Galois group of Q(#y) over Q. Let 4 be an ordinary 
distribution, and let as before the Stickelberger distribution associated with h 
be defined by 


St,(x) = Stx) = Yo h(xcjoz'. 


ce Z(N)* 
If v is a character of G(N) with conductor m, we define 


S(xh) = Sms Mm) = Y xCedh(c/m). 


ceZ(m)* 


In fact, Theorem 1.1 can be made more precise, and again in Chapter 2, 
§8 we proved the following facts. Let M| N, and let y be a character of G(N). 
Let 


1 


ey = Teun Y Kojo, 


be the usual idempotent projecting on the y-eigenspace. 


ST 1. If cond x does not divide M, then 


i 
su( i) = 0. 


ST 2. If cond x divides M and has the same prime factors as M, then 


su( : Je, LQ)! S(% he, . 


M})**~ |G(M)| 


ST 3. If cond x divides M, and we let m = cond x, then 


I, RON ease 
su( mle: a |G(M)| ya XP))S% h)e,. 


p{m 
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An arbitrary value St,(a/M) for a prime to M comes from the formula 


a 1 
su( 4) == 05 su(i) 


so such values do not contain essentially more information on the image of 
the Stickelberger distribution than the normalized values St,(1/M). We 
suppose that h takes its values in an algebraically closed field K of character- 
istic 0, which in the applications is C or C, for some prime p. It would suffice 
to suppose that h takes values in a field containing enough roots of unity, but 
we can always extend scalars and still be able to use Theorem 1.1. The 
element S(y, h) is then an element of K. 


Theorem 1.2. Let h be a distribution with values in K. Let V be the vector 
space generated by the image of the Stickelberger distribution. Then 
e, V is generated by the single element S(x, h), and in particular has dimension 
0 or 1 according as that element is 0 or # 0. 


Proof. The proof of Theorem 8.2 in Chapter 2 in fact proves the statement 
as given here, although we stated previously only the corresponding di- 
mension property. 


Corollary. If S(7, h) # 0 for all x, then St, is universal. 


Proof. This follows just like Theorem 1.1, but we don’t even need to tensor 
with K since the values of the distribution h are already in K, and the values 
of the Stickelberger distribution are already in a vector space over K. 


A distribution h is called odd or even according as 
h(—x) = —hA(x) or h(—x) = A(x). 


From now on we restrict ourselves to distributions whose values are in 
abelian groups without 2-torsion. This condition will not be repeated. Such 
a group may then be embedded in a group where multiplication by 2 is 
invertible. When that is the case, any distribution is uniquely expressible as 
an even distribution plus an odd distribution, in the usual manner. 

Theorem 1.1 then remains valid for odd (respectively even) universal 
distributions, except that the rank is then @(N)/2 for N > 3, which we assume. 

Likewise, in Theorem 1.2, if h is, say, an odd distribution, then St, is also 
odd, and is universal odd if S(¥, h) # 0 for all odd characters x. 


Example. Let 


h(x) = B,(Xx>) ifx #0 
h(O) = 0. 
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We call / the first Bernoulli distribution. As we saw in Chapter 2, Theorem 8.3, 
its associated Stickelberger distribution is the universal odd distribution. 
This comes back to the fact that 


By, = S(z%, h) #0 


for odd characters x. 


§2. The Gauss Sums as Universal Distributions 


In Chapter 2, §10 we already gave the Davenport—Hasse distribution relation. 
Another proof of this relation follows from the Gross-K oblitz formula, and 
is left as an exercise for the reader. Here we are concerned with showing to 
what extent the Gauss sums give the universal odd distribution. 

We let h(x) be the first Bernoulli distribution as mentioned at the end of 
the last section. We pick N = q — 1 where g = p’ for some odd prime p, and 
we have by definition 


St(x) = ¥ h(xe)az?. 


The sum is taken for c € Z(q — 1)*. Write 


Define 
gx) = (a, *)/(at- 9), 


Stickelberger’s theorem gives the ideal factorization of g(x), namely by 
Theorem 2.2 of Chapter 1, we know that 


(g(x)) = p*™. 


It is convenient typographically and otherwise to write this formally 
additively, and thus we call 


St(x)-p 
the associated divisor of g(x). We view at first 


g:(Q/Z),-1 > Cp 
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as a map into the multiplicative group of p-adic complex numbers. The 
distribution relation is then satisfied only with fudge factors. To get rid of 
them, we let 


P = {p, p''*} be the group generated by all roots of 
unity and fractional powers of p. 


We call P the pure group (with respect to the prime p). We may then compose 
g with the canonical homomorphism C} — C}/P to obtain a map 


Gp: (Q/Z)q— 1 ris Cp /P 


into the factor group, which is a distribution by the Davenport—Hasse 
relation. We call gp the Davenport—Hasse or Gauss sum distribution. Note that 
C*/P is uniquely divisible by 2. Recall that the p-adic logarithm has kernel 
equal precisely to the pure group P. Consequently we have a natural iso- 
morphism of distributions 


GSD 1. gp © log, g. 


The map 
St(x) St(x)-p 


is a homomorphism of the Stickelberger distribution (which we know is 
universal odd). The decomposition group D, of p consists of the powers of 
p mod N. If we let G(N) = Z(N)* under the notation 


aroa,, 


then the values of the homomorphic image above can be viewed as lying 
in the group ring 


QLG(N)/D,], 
and 


r-1 
St,(x)modD,= » ( y p'xe))or a 


ceZ(N)*/Dp \i=0 


Lemma 2.1. Let {m(x)} be a family of integers and let 
a= |] gay. 


Then « is pure if and only if div « = 0, and in that case « is a root of unity. 
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Proof. The absolute value of g(x) in the complex numbers is 1. Hence 
|a| = 1. If wis pure, this implies that a is a root of unity. Conversely, assume 
that div « = 0, so «is a unit. The conjugates of g(x) also have absolute value 1 
(themselves being of the same form as g(x)), and it is standard that a unit all 
of whose conjugates have absolute value 1 must be a root of unity. This proves 
the lemma. 


From the lemma, it follows that we have an isomorphism 


GSD 2. gp © St, mod D,. 


Note. Factoring out by D, corresponds to the obvious classical fact 
that the Gauss sums satisfy the relation 


u(x) = 2(x?), 
cf. Chapter 1, GS 4. In the present notation, this is written 
g(x) = g(px). 


Let x be an odd character of conductor d prime to p. Since the Gauss 
sum distribution is also odd, it follows that the function 


at x(a) log, (4) 


is even, for a € Z(d)*. Hence the function is defined on Z(d)*/+1. 


Theorem 2.2 (Ferrero—Greenberg). Let x be an odd character of conductor 
d. Assume that y(p) = 1. Then 


Y x(a) log, (5) # 0. 


ae Z(d)* 
The proof will be based on the following lemma. 


Lemma 2.3. There exists a family of integers {m(d')}, for divisors d’ of 
d, a’ # d, having the following property. Let 


ae su(3) + ¥ ma’ su(3). 


387 


17. Gauss Sums as Distributions 
Let R be a set of representatives for cosets of the group generated by D, 
and +1 in Z(d)*. Then the elements 
646; aeéR 

are linearly independent over Q as elements of Q(G(N)/D >]. 

Proof. As in Theorem 1.2 we look at the -eigenspace for odd characters 
w such that ¥(p) = 1 and cond yw divides d. It suffices to prove that we can 
choose the family {m(d’)} such that 


fe, #0 for all p. 


First we know from ST 2 that if cond y = d then 


1 
St, (3) ey # 0 


because B, , # 0. Let d,; > d, > --- be the other divisors of d, unequal to d. 
Pick a sequence of integers m, < m, < --- which is rapidly increasing. Then 
for any y we have 


1 
su(;)e + Y miu(s eo # 0. 


Indeed, suppose d, is the conductor of y. By ST 1 we conclude that the i-th 
term in the sum is 0 ifi > s. By ST 2 and the fact that B, , # 0 we know that 


1 
su(z)es # 0. 


If the sequence is selected increasing sufficiently fast, then this s-term 
dominates in the sum, which is therefore also not equal to 0, thus proving 
the lemma. 


We return to Theorem 2.2. Let 


1 he 
ool’) Te(@) 


with the family {m(d’)} chosen as in the lemma. By GSD 1, GSD 2 we con- 
clude that the elements 


log, ¢,% withaeR 
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are linearly independent over the rational numbers. By Baker’s theorem 
(in the p-adic case, Brumer) it follows that they are linearly independent over 
the algebraic numbers. Therefore 


Nha 2, x(a) log,(¢,%) = y x(a) log, (4). 


aéZ(d)*/{Dp, + 1} 


because 
a 
> x(a) log, o($) =0 


for each d’ # d since the conductor of x is d. This concludes the proof of 
Theorem 2.2. 


§3. The L-function at s = 0 


Let d be a positive integer prime to p (where p is an odd prime). Let x be a 
primitive even Dirichlet character with conductor d or dp. We take the values 
of x to be in C,. We let @ = a, be the Teichmuller character, and we define 
—n 


Xn = XO 


We know that 
n-1 1 
Ll — 1, x) = —(1 — Xap)" ') © Bn, xn 


For n = 1, we obtain 
LO, x) = —(1 — x1(P)) Bi, x 
Since x, is odd, we know that B, ,, # 0. Hence: 
L,(0, x) = 0 ifand only if x,(p) = 1. 


The next formula is a slight variation of a formula of Diamond [Di], 
but expressed in terms of the p-adic gamma function itself by Ferrero- 
Greenberg [Fe-Gr]. 


Theorem 3.1. Let y be an even character such that y, has conductor d. 
Then 


d 
L,(0, x) = d Xa(0) log, r,(;) + (1 — x1(p))B, y, 1og,(d). 
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We shall prove this formula in §4. Here we derive consequences. Indeed, 
the next theorem amounts to a p-adic analogue of Stark’s conjectures in a 
special case. Considerably more insight in this direction was provided by 


Gross [Gr]. 
As in the preceding section, let x = a/(q — 1) and let 


: qg—-1\7} 
P(x) = (a,’), g(x) = ooooo( 4) 


The formula of Theorem 3.1 involves the log of an analytic expression (the 
gamma function). We shall transform it so that it involves the log of an 
algebraic expression (a Gauss sum). 


Theorem 3.2. Let y be an even character such that y, has conductor d and 
such that x,(p) = 1. Let D, be the subgroup of Z(d)* generated by the 
powers of p. Then 


LOM= Y uO te, a5) # 0. 


ce Z(d)*/Dp 
Proof. By assumption the formula of Theorem 3.1 simplifies to 


c 
LO, va) = X1(c) log, r,(;) 
1 


c= 


Let d|(q — 1), q = p’ where r is the period of p mod d. Let G = Z(q — 1)*. 
Then 7, is defined modulo D, by assumption. Since 


PZ), _ z) = +1, 


it follows at once from the Gross—Koblitz formula that 
log, o(x) = ) log, V,(<p'x)). 
i=1 


In particular, 
log, p(px) = log, (x). 


The sum over c = 1,..., d in the formula for L;(0, x) is then written in the 
form 


Yn) ¥ los, r,(()) =e ae ioe, (5) 
i=0 


ce Z(d)*/Dp ce Z(d)*/Dp 
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where 
—1\7! 
g(x) = ots0(4) 


is the odd distribution of the preceding section. That L;,(0, x) # 0 is then the 
main result of §2. 


§4. The p-adic Partial Zeta Function 


In the complex case, the Hankel transform gives an analytic continuation 
of the partial zeta function to the whole plane. In the p-adic case, we shall 
give an analogue of this partial zeta function due to Washington [Wa 3], 
who also pointed out to me that his formula for the p-adic L-function im- 
mediately gives the value of the derivative at s = 0 in terms of the gamma- 
type functions. 

Let x eC} be such that x-' = Omodm,. Let se Z,. We define the 
Hurwitz—-Washington function 


H(s, x) = x (' 2 ‘xB, 


j=0 J 


where B; is the j-th Bernoulli number. Since the Bernoulli numbers have 
bounded denominator at p (Kummer and von Staudt congruences), it is 
easily shown that H(s, x) is holomorphic for s € Z,. For an integer k > 1, 
we find 


H1. H(i — k, x) = x7*B,(x), 


where B, is the k-th Bernoulli polynomial. This is immediate from the value 
of this polynomial, 


ae 3 ' 
B(X) = > (F)x, 
j=0 V 
which comes directly from the definitions in terms of the generating power 


series, product of e and t/(e' — 1). 
Let N be a positive integer divisible by p. For a € o* we therefore obtain 


a = a 
a(t oat k, i) = N*a 8,(<). 
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H 2. If f is a function on Z(N), then 


1 N-1 
i wm “flat ara k, i) —_ By 5 wa PX "By, pop 


a 


where (f  p)(x) = f (px) and 


By = NO S$ ron,((2)) 


This is immediate by taking the sum over all a = 0,..., N — 1 and sub- 
tracting the sum over a = py, with 0 < y < (N/p) — 1. It is convenient to 
use the following notation. Put 


M, f(x) = f(px), fr = fo™. 


Then H 2 can be written in the form 


H3 wy KOE ieee = {a- k-1M.) f, dE 
; N Q=1 Pk "NN p pJk ke 


pha 


The formula expresses the Bernoulli distribution in terms of H, giving the 
possibility of analytic continuation. 

If y is a Dirichlet character whose conductor divides N, then the preceding 
formula reads 


H4. — “ aar4Hi(1 —k, <) = (1 — x(p)p*” *) Be. x,- 


We now define the Hurwitz—Washington function in three variables, 
H(s;a, N) = y~s ~H — 
(s; a, N) Tas Ve (s. < 


for ae Z*, seZ,, and N equal to a positive integer divisible by p. Then 
H(s; a, N)i is again holomorphic in s except at s = 1. It is the p-adic partial 
zeta function, cf. [Wa 3]. 

One could take the relation of the next theorem as the definition of the 
p-adic L-function, and thus make the present chapter independent of 
Chapter 12. 
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Theorem 4.1. Let x be a Dirichlet character, and let N be any multiple 
of the conductor of x such that N is divisible by p. Then 


IN. 


L,(s, x) = ¥ x(a)H(s; a, N). 


a=1 
pla 


Proof. The left-hand side and the right-hand side have the same values 
at the negative integers, which are dense in Z,, and they are both holo- 
morphic, hence they coincide. 


We are here concerned with finding L‘(0, x). That we are dealing with 


a character x is basically irrelevant, and so for any function on Z(N) we now 
define 


N-1 
Ls. f) = ¥ f@H(s a, N). 


pha 


In finding the expansion at s = 0, we shall meet the Diamond function 
defined by the formula 


G,(x) = (x — 4) log,(x) — x + B; 


= ee, 
j=2 jG-D) 


cf. [Di 1]. This formula arises from the asymptotic expansion of the classical 
complex log gamma function. It converges p-adically for |x| > 1, so G,(x) 
is defined in that domain. We shall analyze later the relation between the 
Diamond function and the gamma function. 


Theorem 4.2. Let p # 2. Let N be a positive integer divisible by p and let f 
be a function on Z(N). Then 


N-1 a N-1 a 
L,(00, f) = xX f (06,(¢) + z f oe, (=) log,(N). 
pha pla 


If f = x is a Dirichlet character, then 


N-1 
L0.0= > 1(0G,( 2) + (1 ~ z1(0))Bi, », 1og,(N). 


pha 
Proof. The desired result is an immediate consequence of the next lemma. 
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Lemma 4.3. Let |a| > 1/p and let N be a positive integer divisible by p. Then 
the coefficient of s in H(s; a, N) is equal to 


co(a)~ '6,(z) + ofa) 'B, (;) log, (N). 


Proof. We have the expansions: 


cote age 
ts . 


<a>*~* = Ca>(1 — s log,<a> +--+) 


r (-1? 


If j> 2, peas te 
d C3 jG- 


Using the fact that B; = 0 for j odd, j > 1, we find that the coefficient of s 
in H(s; a, N) is 


zi A E — log,(a) + + log,(a) - x (“) a 


j=2 


a 1 


= oa*6,( 4) + w(a)7 (a 5) log,(N). 


This proves the lemma. 


Remark. In [Di 2], Diamond discusses the regularization of his function, 
giving rise to certain measures which are then related to the Bernoulli 
measures. See also Koblitz [Ko 1]. In the classical case, gamma-type 
functions appear as coefficients of partial zeta functions (Hurwitz functions), 
and we meet a similar phenomenon here. 


Next, we derive some functional equations. First, for the Washington 
function, we get for p odd: 


H5. H(s; a, N) = H(s; N — a, N). 


Proof. It suffices to prove the formula when s = 1 — k, k > 2 such that 
k =0 mod p — 1, because such integers are dense in Z,. But then the 
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formula is immediate from the fact that k is even (p is assumed odd), and the 
property 


B(1 — X) = (—1)"B,(X), 


which follows directly from the generating function for Bernoulli poly- 
nomials. 


Washington has also pointed out that one can give elegant proofs for 
the following properties of the Diamond function by using the formalism 
of the H-function. We assume p odd. 


G, 1. G,(1 — x) + G,(x) = 0. 


Proof. In Lemma 4.3 we found the coefficient of s in H(s; a, N). Using H5, 
and replacing a by N — a in this coefficient, we now see that 


This is true for any positive integer N divisible by p, and any p-unit a, thus 
proving the formula. 


G, 2. G,(—x) + G,(x) = —log,(x). 
Proof. Immediate from the power series expansion. 
G, 3. G,(1 + x) — G,(x) = log,(x). 
Proof. Immediate from the preceding two properties. 


Theorem 4.4. Extend G,(x) to Q, by putting G,(x) = 0 if x€Z,. Then 
for all x € Z, we have 


Pol (x +6 
» o,( F ) = log, I’,(x). 
=0 


Proof. Both sides are continuous and satisfy the functional equation 
f(x + 1) = f(x) + d(x)log, x, 
where 6(x) = 0 if x = 0 mod p, and 6(x) = 1 otherwise. This is true for 
log, I’,(x) directly from the definition of I’,, and is true for the other side by 


G, 3. Hence the two functions differ by a constant. Putting x = 0 gives 0 
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on the right-hand side. By G, 1 we conclude that the left-hand side is also 
equal to 0. This proves the theorem. 


Theorem 4.5. Let y be a Dirichlet character such that the conductor d of 
X, is not divisible by p. Then 
d-1 - 
L,(0, va) — > x1(c) log, r,(5) aE a — 1(P)) Bi, y, log,(d). 
c=1 


Proof. Let N = pd. In Theorem 4.2, write 
a=c+bd 


with 1 <c<d—tand0 <b < p-—1. Then y,(a) = x,(c). If a is divisible 
by p then a/N € Z,,so that G,(a/N) = 0 in Theorem 4.4. The desired formula 
is then a direct consequence of Theorem 4.2 combined with the relation of 
Theorem 4.4, and the fact that log,(p) = 0. 


The formula of Theorem 4.2 is due to Diamond. The argument used to 
derive the variation in Theorem 4.5 is in Ferrero—Greenberg [Fe-Gr]. 


For the record, we state the distribution relation for the Diamond function. 


G, 4. For any positive integer m and |x| > 1, we have 


2,2 . “) = G(x) ~ (& — 9) log,(m), 


In particular, if m = p” is a power of p, then: 


pr-1 
G, 5. y o,(*5") = G(x). 

a=0 p 
This is a special case of G, 4 because log,(p) = 0. The proof of G, 4 can easily 
be given following a similar argument to that of Theorem 4.4, using the 
analyticity of G,(x) for |x| > 1. Our intent was to deal mostly with the 
cyclotomic applications, and we don’t go into a systematic treatment of these 
p-adic functions. 


396 


Appendix by Karl Rubin: 
The Main Conjecture 


Introduction 


In [Th], Thaine introduced a new method for studying ideal class groups 
of real cyclotomic fields using cyclotomic units. Recently, in [Kol], 
Kolyvagin developed a remarkable strengthening of this method, an in- 
ductive procedure which has Thaine’s method as its first step. If p is an 
odd prime, Kolyvagin was able to determine the orders of the different 
eigenspaces of the p-part of the ideal class group of Q(u,), decomposed 
with respect to characters of Gal(Q(u,)/Q) (see Theorems 4.2 and 8.8). 
These results were already known from the work of Mazur and Wiles 
[M—W], but Kolyvagin’s proof is very much simpler. 

In this appendix we give an exposition of Kolyvagin’s results on ideal 
class groups and extend Kolyvagin’s methods, using ideas from [Ru], to 
prove the full Mazur—Wiles theorem (the “main conjecture”) for Q(n,=) 
(see §§5 and 8 for the precise statements). In §§1—7 we concentrate on 
cyclotomic units and ideal class groups of real cyclotomic fields. Then in 
§8 we combine these results with the Kummer duality of Chapter 6 to 
study “minus” ideal class groups. It is also possible, as in [Kol], to do a 
direct analysis of the minus class groups using Gauss sums instead of 
cyclotomic units. 

The appendix relies only on Chapters 1 through 7. 


§1. Setting and Notation 


Fix a positive integer m and let F = Q(n,,)*. Let Y denote the set of 
positive squarefree integers divisible only by primes | = +1 (mod m) (that 
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is, by those | which split completely in F/Q). For every re # write 


G, = Gal(F(n,)/F) + Gal(Q(n,)/Q) 


and write N, for the norm operator 


N,= ¥, te Z(G]. 


r 
teG, 


We will often use additive notation for the multiplicative group F(p,)*. 
There is a natural isomorphism G, = [|] G, (product over primes / dividing 
r), and ue 


N, = I N, € Z[G,]. 


If 1 = +1 (mod m) and | Jr, then we will identify G, with Gal(F(p,,)/F(n,)), 
and we will write Fr, for the Frobenius of | in G,, the automorphism 
which sends each r-th root of unity to its -th power. For every prime 
1 = +1 (mod ™m) fix a generator o, of G, (which is cyclic of order | — 1) 
and define 


1-2 
D, = ¥. iofe Z(G. 
i=1 


This “operator” is constructed to satisfy the identity 
(1) (,— )D,=(1—-1I)—N, 
in Z[G,]. For re & define 


D, = I D, ¢ Z[G,]. 


Fix a primitive m-th root of unity ¢,, and, for each prime /= +1 
(mod m), a primitive [-th root of unity ¢,. For re # define 


b= (<n fs 2 1) (<a! []u- 1) 


These algebraic integers satisfy, for re Y and primes | |r: 


ES 1. &,¢ F(p,)*. 
ES 2. é, is a (cyclotomic) unit if r > 1. 
ES 3. Ni¢, = (Fr, — 1)e,p- 


ES 4. €, = &,,, modulo every prime above I. 
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Proof. The first two assertions are clear. ES 3 is the distribution rela- 
tion satisfied by the cyclotomic units (see CU 2, Chapter 6, §3) and ES 4 
is immediate from the fact that ¢, = 1 modulo all primes above 1. 


Fix an odd integer M (later M will be a large power of some chosen 
prime p); we will work “modulo M”. Let 


Sy = {re F:r is divisible only by primes | = 1 (mod M)}. 
§2. Properties of Kolyvagin’s “Euler System” 
Lemma 2.1. If re Sy, then Dé, € LF(p,)* /(F (p,)*)”@ 1. 


Proof. We prove this by induction on the number of primes dividing r. 
If r = 1 the statement is clear. If /|r, by ES 3 and (1) 


(a, — 1)D,¢, = (2 = 1) — N)Dyid, = 1 — Fr) Dyig, (mod(F(n,)*)'*). 
Since Fr, fixes F, our induction hypothesis shows 
(1 — Fr) Dyndon € (F (ayn). 
These o, generate G,, so this proves the lemma. 
Lemma 2.2. For every re Sy there is a x,€ F*/(F*)™ such that 
x, = D,¢, (mod(F(n,)“)). 
Proof. First observe that since M is odd and F is real, py F = {1}. 
Since r is prime to M, a simple ramification argument shows that 


Hy OF (p,) = {1} as well. 
Define a 1-cocycle c,: Gal(F(p,)/F) > F(p,)* by 


c,() = [(o eo 1)D,¢,] lec 


By Lemma 2.1, c,(o) € F(p,)*; it is uniquely defined because F(p,) has no 
nontrivial M-th roots of unity. Since H1(F(p,)/F, F(p,)*) = 0, there is a 
B € F(p,)* such that (o — 1)8 =c,(a) for all o. Define x, = D,é,/B™. Then 
k,éF”, and since B is unique modulo F*, x, is well-defined modulo 
(F*)”. This proves Lemma 2.2. 


Remark. We have obtained x, from D,é, via the sequence of 
isomorphisms 


LF (u,) “AF (u,) 1% © HY (F/F(H,), Bag) © Ht (E/E, Bay) © F* (EX 
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Each xk, gives a principal ideal of F (modulo M-th powers of ideals) 
which can be viewed as a relation in the ideal class group of F. These 
relations will be used to bound the size of the ideal class group. To 
do this, we must understand the prime factorizations of these ideals 
(Proposition 2.4 below) and also how to choose r so as to get useful 
relations (Theorem 3.1). 

Let @, denote the ring of integers of F, and write % =) Z/ for 


the group of fractional ideals of F, written additively. For every ra- 
tional prime / write 4 = p24, so = GY, and if ye F* let (yye 
All i 


denote the principal ideal generated by y and (y),€-4, [y]e-4/M¥#% and 
Ly], € -4/M.4, the projections of (y). Note that [y] and [y], are also well 
defined for ye F*/(F*)™. Write G = Gal(F/Q). 


Lemma 2.3. Suppose | splits completely in F and |= 1 (mod M). There 
is a unique G-equivariant surjection 


Q: (Op/lOp)* + 4/MF, 


which makes the following diagram commute: 


x71 va As 


(O,/lOp)* 4I,/M4, 


(For each i of F above | and 1' of F(p,) above 4, we have identified 
Orqy/A with Op/2) 


Proof. Since [F(p,):F] =!—1 and all primes above / are totally, 
tamely ramified in F(p,)/F, the vertical maps are both surjective and the 
kernel of the left-hand map, namely the subgroup 


{x © F(p,)*: x has order divisible by / — 1 at all primes above /}, 


is clearly contained in the kernel of the right-hand map. This proves the 
lemma. 


For | as in Lemma 2.3 we will also write @, for the induced 
homomorphism 


or ty € FX (F*)" : Ly] = 0} > 4/M4,. 
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Proposition 2.4 (Kolyvagin). Suppose r € Sy, and | is a rational prime. 


(i) If lyr, then [x,], = 0. 
(il) If Ir, then [k,], = @(K,)- 


Proof. By definition, x, = D,é, (mod(F(p,)*)”). If [}r, then all primes 
above | are unramified in F(p,)/F, so (i) follows from ES 2. Suppose |r, 
say r = Is. Recall that we can represent x, and x, by x, =D,é,/B™ and 
Ks = D,¢,/B3" where , € F(u,)* and B,€ F(p,)* satisfy 


(2) (© —1)8,=Co-—1)D,g1"" and = (6 — 1B, = [(o — 1)D,6,1"™. 
By (i) we may choose f, prime to I. 
In particular, the ideal of F(p,) generated by B™ is the lift of an ideal 


of F, so (since all primes above / are unramified in F(p,)/F(p,)) there is a 
y € F(p,)* such that B,y“"Y™ is a unit at all primes above J. Then 


[Niy], = Le, J)- 


Modulo any prime above /, using (2), (1), ES 3, and ES 4, 


(1 — o,)y"-?™ = (6, — 1B, = [(( — 1) — N) D,é,1"™ = D,g-viM 
l I r 1 sor [(Fr, — )D.] aT 


1-1)/M 
_ De 


= =(D M yi 1)/M_ 
Fe pg = Dee/B9 


Therefore, applying the diagram of Lemma 2.3 with yeF(p,)*, we 
conclude that 


Ly]: = @(K,). 


§3. An Application of the Chebotarey Theorem 


Fix a rational prime p > 2 and let C denote the p-part of the ideal class 
group of F. The next theorem together with Proposition 2.4 will enable 
us to construct all the relations we need in the ideal class group of F. In 
the simpler case where p} #(G) it already appears in the work of Thaine 
({Th], Proposition 4); the version below which we will need is essentially 
Theorem 5.5 of [Ru]. 


Theorem 3.1. Suppose one is given ce C, Me Z a power of p, a finite 
G-submodule W of F*/(F*)™, and a Galois-equivariant map 


w: W > (Z/MZ){G]. 
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Then there are infinitely many primes 4 of F such that 


(i) Aec. 
(ii) = 1 (mod M) and ! splits completely in F/Q, where | is the rational 
prime below A. 
(iii) [Ww], =0 for all we W, and there is a ue(Z/MZ)* such that 
~(w) = ub(w)d for all we W. 


Proof. Let H be the maximal unramified abelian p-extension of F, 
so that C is identified with Gal(H/F) by class field theory. Write 
F’ = F(pty,). We have the diagram below. 


Step I. FNH =F. 
The inertia group of p in Gal(F(p,,)/F) has index either 1 or 2, so 
there is no nontrivial unramified p-extension of F in F’. 


Step Il. F(W*™)NH = F. 
As in Chapter 6, Kummer theory gives a nondegenerate Gal(F’/Q)- 
equivariant pairing 


Gal(F(W"™)/F’) x W/W' > bn, 


where W’ is the kernel of the map from W into (F’)*/[(F’)*]™. Let t 
denote complex conjugation in Gal(F’/F). Then 7 acts trivially on W and 
by —1 on py, so t acts by —1 on Gal(F’(W"™™)/F’). Since H is abelian 
over F, t acts trivially on Gal(H/F) ~ Gal(HF'/F’). Therefore t acts on 
Gal(F’'(W'™)N HF'/F’) by both 1 and —1, so F'(W™”)N HF’ = F’ and 
step II follows from step I. 


Step Ill. F*/(F*)“ > (F’)* /((F’)*)™ is injective, so W’ = 0. 
By the inflation—restriction sequence of Galois cohomology, 


ker(F*/(F*)™ — (F')* ((F’)*)") = ker(H?*(F/F, ay) > H'(F'/F’, Ba)) 
= H'(F(Wy)/F, Ha): 
Since Gal(F’/F) is cyclic, 
#(H'(F(Wy)/F, Ba) = #(H°(F(Wy)/F, Ba) = #(Way(F)) = 1. 
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Step IV. Construction of A satisfying (i), (ii), and (iii). 
Combining step III with the Kummer pairing in step IJ, we conclude 


Gal(F'(W™™)/F’)  Hom(W, py). 


Fix a primitive M-th root of unity ¢, and define 7: (Z/MZ)[G] > py, by 
(1g) = Cy and i(g) = 1 for géG, g ¥ lg. Let ye Gal(F'(W'™)/F’) be the 
element corresponding to 10 ye Hom(W, py). Then by definition of the 
Kummer pairing, 1 0 W(w) = y(w'™)/w"™ for all we W. 

Since F’((W'”)QH =F we can choose 6¢€Gal(HF'(W'™)/F) such 
that 6 restricts to y on F’(W*™) and to ¢ on H. Let 4 be any prime 
of F of degree 1, unramified in HF’(W'™)/Q, whose Frobenius in 
Gal(HF'(W™)/F) is the conjugacy class of 6. Since W is finite, the 
Chebotarev theorem guarantees the existence of unfinitely many such A. 
We must verify that 1 satisfies (i), (11), and (iii). Let | be the rational prime 
below 2. 

The identification of C with Gal(H/F) sends the class of / to the 
Frobenius of 4, so (i) is immediate. Also, since 6 is trivial on F’, | splits 
completely in Q(p,,)/Q which proves (ii). The first assertion of (ili), that 
[w], = 0 for all we W, holds because / is unramified in F’(W1™)/F. 

From the definition of @,, ord,(g,(w)) = 0 if and only if w is an M-th 
power modulo 4. Also, 


ord,(W(w)A)=0 = 1o¥(w=1 = y(wlM)/wlM = 1 


<> wis an M-th power modulo J. 
Therefore there is a unit ue (Z/MZ)* such that 
ord ,(y,(w)) = u ord, (W(w)A) for all weW. 


It follows that the map w— @,(w) — uy(w)/ is a Gal(F/K)-equivariant 
homomorphism into @ (Z/MZ)J’, which has no nonzero Gal(F/K)- 


AN AA 
stable submodules. This proves (iii). 


§4. Example: The Ideal Class Group of Q(p,)* 


In this section we give Kolyvagin’s analysis of the ideal class group of 
Q(p,)*. Although not needed for the proof of the Mazur—Wiles theorem, 
this is the basic example of Kolyvagin’s method. 

We will apply the results of the previous sections with F = Q(p,)", 
where p is an odd prime. Let C denote the p-part of the ideal class group 
of F, E the group of global units of F, and & the subgroup of cyclotomic 
units (see Chapter 7, §5). For every character y of G = Gal(Q(n,)*/Q) 
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define the y-idempotent 


2 
=—— VF x0). 


e(x) = 

p—l ye 
If Y is a Z,[G]-module, we write Y(y)=e(y)Y. We will be interested 
in C(y) and (E/&)(x), where (E/&)(z) denotes the y-component of the 
p-Sylow subgroup of the finite group E/é. 


Theorem 4,1. For every character x of G, #(C(~))|#[(E/E&)(y)]. 
Prouf. We can assume x # 1, or else C(y) = (E/&)(z) = 0. Let 
M = #[(E/E)(WIA(CW))p, 


and let c,. ..., ¢, be any collection of ideal classes which generate C(y). 
We will choose inductively primes 4,, ..., 4, of F such that the class of 
A, is ¢, and the rational prime [; below 4; satisfies |; = 1 (mod M). 

Suppose 1 Sisk and we have chosen 4,, ..., 4;-, lying above the 
rational primes /,, ..., J;_; = 1 (mod M). We choose /; as follows. Write 
r, = []J, (so r, = 1) and let W be the subgroup of F*/(F*) generated 


by e(DK;,. Define ft; to be the largest divisor of M such that e(y)k,, € 
(F*)'*/(F*)”, and define a map W: W>(Z/MZ)[G] by W(e(y)k,,) = tie(%). 
Now apply Theorem 3.1 with c= c; and M, W, and w as above. Choose 
2, to be any prime of F satisfying Theorem 3.1 with this data, and let 
I, be the rational prime below /,;. Then /;=1 (mod M) and there is 
au;€(Z/MZ)”* such that @,,(e(y)K,,) = ujtie(Z)Ai- 

Let E’ (resp. &’) denote the image of E (resp. &) in F* /((F*)“. Then 
E‘(z) and &’(y) are cyclic, and (E/&)(y) = E’'(x)/€'(z). Recall 


Ky = (1 T C,)( ~~ Co) 

so e(y)K, generates &’(y). Thus t, = #[(E/&)(y)]. 

For each i > 1, by Proposition 2.4 the principal ideal [e(y)x,,] ¢ .%/M#% 
satisfies 

[ek,,,,] = (ex) + ¥ Lee...) 
j<i 

(3) =u,te(yA; in F#/(MY, e(y)A1,---, )A:-1)- 
Since e(y)K,. €(F*)*! we see that ¢,4,|t;. In particular ¢,,,|t; = 


#[(E/AQI, so (M/t;.,)C() =0 and we can divide (3) by ¢,;,, and 
project into C(y) to get 


(ti/tis1)¢; =O in C(y)Mcq,---, Ci-1)- 
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Thus since ¢,,..., ¢, generate C(y), 


k 
#(C(y) I] (ti/tisa) = ty /tear = FUE/E)\(DV/theas- 


Theorem 4.2 (Mazur--Wiles, Kolyvagin). For every character y of G, 


#(C(x)) = #L(E/E)(y)I. 


Proof. By the analytic class number formula (Theorem 5.1 of Chapter 3), 
I] #(C) = A(O) = 4(E/6)@ Z,] =[] #L(E/O)()1- 
x x 


The theorem follows immediately from this and Theorem 4.1. 


§5. The Main Conjecture 


Fix a rational prime p > 2, and for every integer n => 0 let 
K,, = Q(B n+), K, — U K,. 


Put 
A = Gal(Ko/Q) = (Z/pZ)* and Tl = Gal(K,,/Ko)  Z,; 


then Gal(K,,/Q) = A x I. For 1 <n<o write C, for the p-part of the 
ideal class group of K,, E, for the group of global units of K,, and 
é, for the group of cyclotomic units of K,. Write U, for the group of 
local units of the completion of K, above p which are congruent to 1 


modulo the maximal ideal, and let E, and V, denote the closures of 
E,0U, and &,9U,, respectively, in U,. We also define 


C,=limc,, E,=limE, V,=limV, and  U,=limU,, 
<— <— a <_ 


all inverse limits with respect to the norm maps. For nS let Q, be the 
maximal abelian p-extension of K, which is unramified outside of the 
prime above p, and write X, = Gal(Q,/K,). Define the Iwasawa algebra 


A =Z,{(0]] = lim Z,[Gal(K,/K)]. 
For every character y of A define the y-idempotent 


1 


ey) = at a x *(0)6. 
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If Y is a Z,[A]-module, we write Y(y) = e(y)Y. In particular, C,,(x), 
U,(~); E(, Vi(~, and X,,(y) are all finitely generated A-modules, C,,(z) 
is a torsion A-module, and if x is an even character X,,(y) and U,,(~)/V..(y) 
are torsion as well (§§5 and 6 of Chapter 5, §§2 and 5 of Chapter 7). 

Recall that two A-modules are said to be quasi-isomorphic if there is 
a map between them with finite kernel and cokernel. By Theorem 3.1 
of Chapter 5, every finitely generated torsion A-module Y is quasi- 
isomorphic to a module of the form Q)A/f,A for some f,;¢ A, and the 
characteristic ideal (|| f,)A is a well-defined invariant of Y which we will 
denote by char(Y). 

The following theorem is one of several equivalent forms of Iwasawa’s 
“main conjecture”, first proved by Mazur and Wiles in [M—W]. This 
theorem will be proved in §7. For a discussion of some of the other 
formulations see §8. 


Theorem 5.1. For all even characters x of A, 


char(C,,(x)) = char(E,.(x)/Vn.(z)). 


§6. Tools from Iwasawa Theory 


In using Proposition 2.4 and Theorem 3.1 to prove Theorem 5.1, we will 
need to know about the structure of C, and E, as Z,[Gal(K,,/Q)]- 
modules. For any fixed n we know very little, but Theorem 3.1 of Chapter 
5 (the classification theorem for A-modules) shows that C,, and E,, are 
well behaved as A-modules. In this section we relate C, and E, with C,, 
and E,,. The major results of this section (Theorems 6.1 and 6.3) are due 
to Iwasawa [Iw 12], although our proof of Theorem 6.3 is different from 
his. 

For every n let I, = Gal(K,/K,) and let I, denote the ideal of A 
generated by {y — 1: ye€TI,,}; if y is a generator of I then I, = (y?" — 1)A. 
Write 


A, = A/I,A © Z,[Gal(K,/Ko)]. 
If Y is a A-module we write 
Y,, = Y/I,Y = Y@ A,. 
We need to study the natural maps 


X(0r, ax XX) Cr, = Ci(%) U(r, a U,(%), 
E,Wr,7 Enl%, and Var, > Val) 


induced by the projection maps. 
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Theorem 6.1. For every x the natural map C,(y~)r,— C,(y) is an iso- 
morphism. If yx is even and x #1 then the natural maps 


Xo(Or, > Xl, — Unr, 7 Ul), = and V(r, > Val 


are isomorphisms. 


Proof. For C,(y) this is Theorem 4.1 of Chapter 5. For X,,(y) the 
proof is the same (but we now need xz # 1); see §6 of Chapter 5. For 
U,,(%) and V,,(y) this is Theorems 2.2 and 5.1 of Chapter 7, respectively. 


Lemma 6.2. 


(i) Suppose 07>W—>Y->Z-0 is an exact sequence of A-modules. 
For every n the kernel of the induced map W,,— ¥;. is a quotient 
of Z™. 

(ii) If Z is a finitely generated A-module and Zy,, is finite, then Z'™ is 
finite. 


Proof. Fix a generator y of IT. Applying the snake lemma to the 
diagram 

0 0 0 

0 ———> Wh ——_, yl™ 


i] 


0 ———> W ——-| — 


[er for 
> 


N 
a 


0 ———> W Y — Z ——— 0 
i Se TE ee Ee 
0 0 0 


proves (i). For (ii), if Zp, is finite then the right-hand column of the 
diagram shows that Z/Z'™ is quasi-isomorphic to Z. By Theorem 3.1 of 
Chapter 5 this is impossible unless Z'™ is finite. 
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Theorem 6.3. If y is even and y #1, then there is an ideal & of finite 
index in A such that for every n, annihilates the kernel and cokernel 
of the natural map E.,(%)r, > E, (x). In particular the kernel and cokernel 
are finite with order bounded independently of n. 


Proof. Consider the two diagrams with exact rows 


(U(0/ExO)r, —“— X.(0r, ——— C.(Wr, ———> 0 


TUE 


Cie S20 


and 
: —a_, r —— ~— —— 0 
0 ——_ E,() ———>_U",)_~————__U,,W)/E,(x) +~$-——> 0. 


The first one is induced by class field theory (Chapter 5, §5). Applying 
the snake lemma to the second diagram, since the map from U,,(z)r, to 
U,(y) is an isomorphism by Theorem 6.1, we see that coker(z,) ~ 
ker(zy,g). Since the map from X,(y)r, to X,(x) is injective by Theorem 
6.1, ker(zy)z) = ker(¢,). By Theorem 6.1, C.(y)r, * Gi”) is finite, so 
Lemma 6.2 shows that ker(¢,) is a quotient of C,,(Qsinites the maximal 
finite A-submodule of C,,(y). (Since C,, is finitely generated, C,,(Y)rinite 1S 
well defined and finite.) 
Similarly, ker(z,)  ker(¢,). Since 


#((U,(0/ExO)r,) S (Un) : En) # (ker(tty;x)) 
is finite for y #1 (by Leopoldt’s conjecture, Theorem 4.2 of Chapter 4), 
Lemma 6.2 shows that ker(¢,) is a quotient of (U,(~/Ex(X))rinites the 


maximal finite submodule of U,(y)/E,,(y). (In fact, one can show that 
(U,(~/E(WO)rinite = 0, but we do not need this.) Thus if we take 7 to be 


the annihilator in A of 
CyOrinite ® (U,((0/EalM)rinites 
& satisfies the conditions of the theorem. 


For every character y of A fix a generator h, € A of char(E,.(%)/V..(x)). 
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Corollary 6.4. Suppose xy is even and y #1. There is an ideal & of 
finite index in A such that for every n € oA and every n, there is a map 
6.9: En(~) > A, such that 6,.,(Vi(~) = nh,A,. 


Proof. Since U,,(y) is free of rank one over A (Theorem 2.1 of Chapter 
7) and 0 # E,,(y) < U,,(~), E.,{z) is torsion free and rank one. Therefore by 
the classification theorem (Chapter 5, Theorem 3.1) there is an injective 
homomorphism 0: E,,(y) > A with finite cokernel. 

We first claim that 6(V,(%))=h,A. Clearly 6 induces a quasi- 
isomorphism E,,()/V,.(y) > A/@(V,.(y)); since V,,(y) is free of rank one 
over A (Theorem 5.1, Chapter 7), 


A(V.()) = char(A/O(V,,(x))) = char(E..(7)/V..(%)) = h,A. 


Let © be an ideal of A satisfying Theorem 6.3. Fix an n and let 6, be 
the homomorphism from E,,(z)r, to A,, induced by 6, and z, the projection 
map from E,(z)r, to E,(~). For any n €. we can define non? E,(y) > A, 
so that the following diagram commutes: 


Tn " 
E(x) —go— An 


Er, ———> } 
Onin A 


ie., O,,,(u) = 8,(7, '(nu)). This is well defined because by Theorem 6.3, 
annihilates coker(z,,), and ker(z,,) is finite so ker(z,) < ker(6,). Then since 


V(X) = Tr(Vi(X)), 
Bn.n VilX)) > 78,(V..(x)) _ nh, Ay. 


This concludes the proof of Corollary 6.4. 


By the classification theorem, C,,(y) is quasi-isomorphic to a module of 
the form 


k 
D A/f,A 
with nonzero f,¢ A. In particular 
k 
writing f,= [|] Ff. we have char(C,,(y)) = f, A. 
i=1 


Corollary 6.5. Let f;, ..., f, be as above. There is an ideal B of finite 
index in A and for every n there are classes ¢,, ..., ¢,€C,(%) such 


409 


Appendix by Karl Rubin: The Main Conjecture 


that the annihilator Ann(c;)<c A, of c¢; in C,(/(Ane, 40°: + Anei-1) 
satisfies B Ann(c;) < f,A,- 


Proof. On torsion A-modules, the quasi-isomorphism relation is 
reflexive, so there is an exact sequence 


k 
0+ DAA > Col) > Z 0 

i=1 
with a finite A-module Z. By Theorem 6.1 and Lemma 6.2, tensoring 
with A, = A/I,A yields 

k 

Zn — >) A, /fiAn — Ch) aa Zr, = 0. 

i=1 

Let @ be the annihilator of the finite module Z and choose ¢; to be the 


image of 1 in the i-th summand A,/f,A, under the map above. The 
corollary now follows. 


Lemma 6.6. Let y be an even character of A, and let 


f,A =char(C,(y)) and hy, A = char(E..(9)/V,.(2) 


as above. 
(i) For every n, A,/f,A, and A,/h,A, are finite. 
(ii) There is a positive constant c such that for all n, 


1 HIG). “sc FEO) © 


. #(An/F An) ; #(A,/h,A,) ms 


IIA 


(iii) If x = 1 then f, and h, are units in A. 
Proof. First suppose y # 1. From a quasi-isomorphism 
k 
C,.(%) > 7) A/fiA 
we get, for every n, a map 
k 
CA) x Cor, — 7) Aa /fi:An 


with kernel and cokernel finite and bounded independently of n. In parti- 
cular each A,/f,A, is finite. It follows easily that A,/f,A, is finite for 
every n, and by Theorem 1.2 of Chapter 5, 


k 
#(An/fAn)/ # (@ Aufiba) 
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is bounded above and below independently of n (in fact, one can show 
that 


k 
#(A,/f,An) = # (@ Aulfide) 
This proves the first part of (i) and (ii). Similarly we have maps 


(E(O/ValX)r, > A,/h, Ay, 
(E(O/VelM)r, > EnQO/ Val 


with kernel and cokernel finite and bounded independently of n (using 
Theorems 6.1 and 6.3 for the second map). This gives the second part of 
(i) and (11), for y 4 1. 

Now suppose y= 1, and write Q, = K4. Then C,(z) = C4 is the p- 
part of the ideal class group of Q, for every n, and K4 = Q. Therefore 
C,(%) = 0 for every n by Theorem 4.3 of Chapter 5, and f, is a unit. 

Also when y=1, E,(y)= ES and V,(y)= VA. The analytic class 
number formula for the field Q, (the analogue of Theorem 5.1 of Chapter 
3, with the same proof), together with Leopoldt’s conjecture (Theorem 
4.2 of Chapter 4), shows that [E,(): V,(z)] = #(C,(~)) = 1 for every n. 
Therefore h, is a unit and the inequalities of (ii) hold with y = 1,c = 1. 


§7. Proof of Theorem 5.1 


We now come to the main result. For this section fix n and write 
C=C,, E=E,, and V = JV,. We will apply the results of §§2 and 3 with 
F = K,. Note that by Theorems 4.2 and 4.3 of Chapter 3, if y is even we 
can identify C,(z) with the y-component of the ideal class group of F. If | 
is a rational prime splitting completely in F and we F™%, recall that 
(w),€-4%, is the portion of the principal ideal (w) which is supported on 
the primes above / and [w],e.4/M.¥% its projection. If 1 is a prime of F 
above / then 4%(7) = e(y)(4 @ Z,) is free of rank one over A,, generated 
by A(x) = e(y)4, and we define v, =v, ,: FX >A, by v,(w)A(~) = e(y)(w),. 
We will write v, for the corresponding map from F*/((F*)” to A,/MA, 
satisfying V,(w)A(x) = e(~) [w]). 
Recall that for re A, we have x, e F*/(F*)™ as defined in §2. 


Lemma 7.1. Suppose ré Sy, I\r, and 2 is a prime of F above |. Let B 
be the subgroup of the ideal class group C generated by the primes of F 
dividing r/l. Write ce C(y) for the class of e(y)A and write W for 
the A,-submodule of F*/(F*)M generated by e(y)x,. If n, fe A, are 
such that the annihilator Ann(c)< A, of c¢ in C(x)/B(y)_ satisfies 
n Ann(c) < fA,, A,/fA, is finite, and 


M > #(C(QQ)#(AQ/M A(Q))/AnLe(OK, Li), 
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then there is a Galois-equivariant map ~: W > A,/MA,, such that 


Sle(W,) = nv, (k;,). 


Proof. Let B be any lift of e(y)x, to F*. Then 


e(x)(B) = v,(B)A(x) + y e(x)(B)q. 


By Proposition 2.4(i), (B),€ MY, if qtr. Since M annihilates C(x) we con- 
clude that v,(B)A(y) projects to 0 in C(xz)/B(x) and therefore nv,(f) € fA,. 
Write 5 = nv,(B)/f; division by f is uniquely defined because A,/fA, is 
finite. 

Define y: W>A,/MA,, by W(pe(y)k,) = po for all p € Z[Gal(K,,/Ko)]. 
This map has the desired property but we need to show that it is well 
defined. Suppose pe(y)k,=0, that is, pB =x™ with xeF*. Then in 
particular p[e(y)x,], = 0. Write h = #(C(y)). By our assumption on M, 
(M/h)(.4(~)/M.4()) < A, Le(x,],, so we must have p € hA,. Then 


e(X(x) = Y eq 
=M'e(x)(pBi + Y eH), + » he(z)(p/h)(M *(B)q) 


q\(r/1) 


= Me(nleph (mod ® 4400 hata) 


Since h annihilates C(y) we conclude that M~'e(y)(pf), projects to 0 
in C(y)/B(y). Thus M~'y,(pB)c = 0 in C(y)/B(y), so péf = nv,(pB) € MfA, 
and w(pe(z)K,) = pd € MA,,. This concludes the proof of the lemma. 


Recall char(E,,(x)/V,.(%)) = 4,A and char(C,,(y)) = f,A, where fy = il Si. 
i=1 


Theorem 7.2. For every even character x of A, char(C,,(z)) divides 
char(E..(x)/V..(x))- 


Proof. If x = 1, this is true because both characteristic ideals are trivial 
by Lemma 6.6. Thus we may assume x # 1. 

Recall that «, is represented by €=(C,,—1)(Cpn — l)eF*, and 
E(y) = €° generates V,(y). Let ¢,, ..., ¢,€C(y) be as in Corollary 6.5. 
Also choose one more class ¢,,, which can be any element of C(x) at all 
(for example, c,,; = 0). Fix an ideal @ of finite index in A satisfying both 
Corollaries 6.4 and 6.5, and let 7 € @ be any element such that A,,/7A,, is 
finite for all m (ie. for all m, 7 is prime to y”?” — 1, where y is a generator 
of T). Let @=6,,,: E(y) > A, be the map given by Corollary 6.4 with 
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this choice of 4; without loss of generality we can normalize @ so that 
A(C(x)) = nhy. 

Let h be any integer such that p" > #(A,/nA,) and p’ > #(A,/h,A,), 
which is finite by Lemma 6.6. Fix M = #(C(y))p"***". 

We will use Theorem 3.1 inductively to choose primes 4; of F lying 
above |; of Q for 1 <i<k +1 satisfying: 
(4) Ai E Cis I,=1 (mod M), 


(5) ¥,(«1,) = 44 nhy, Fi-1¥i,(Ky,) = uiNV;, ,(K,,_,) for 2<i<k+1, 


where r; = =H l, and u; €(Z/MZ)”*. 
For the first step take c=c,, W =(E/E™)(y), and 


W: W > E(Q/EQ™ 4 A,/MA, £> e(y)(Z/M Z)[Gal(F/Q)]. 


Let 4, be any prime satisfying Theorem 3.1 with this data, and /, the 
rational prime below /,. Then (i) and (ii) of Theorem 3.1 imply (4). By 
Theorem 3.1(iii) and Proposition 2.4(ii), for some u, €(Z/MZ)*, 


Va (K AL) = e(~) LK, Ji, = (x) Pi, (K1) = u,W(K,)Ai(y) 
=u, O(E(X))Ar(~) = win, A, (%).- 
Since (.4,/M.%,,)(z) is free over A,/MA,, generated by 4,(), this proves 
(5) for i= 1. 
Now suppose 2 <i<k-+1 and we have chosen 4,, ..., 4;_, satisfying 


(4) and be We will define /;. Let r_, = BO: By (5), vi,_,(k,,.,) divides 
ne hy 


#U(4,,./MA,_)/AnLK,, Ti, < #(An/1'*h,A,) S p™ 
Let W, be the A,-submodule of F*/(F*)™ generated by e(y)k,, ,- Using 


Corollary 6.5 and Lemma 6.6, Lemma 7.1 (with r=r,_,, 1=1,_,) yields 
a map y;: W,-> A,/MA,, such that 


Si Wile.) = NV, (Kp): 


Now choose 4; satisfying the conclusions of Theorem 3.1 with c¢ = ¢,, 
W=W, w= Bile and M as above. Then (i) and (ii) of Theorem 3.1 
imply (4) for i. By Proposition 2.4(ii) and Theorem 3.1(iii) there is a 
u; €(Z/MZ)* so that 

fi-1V1, (JAC) = fi-1e) [x], = fii Pi (e(WK,,_,) 
= fi-t uw(eOr,, A(X = un, (Kr, )Ai- 
This proves (5) for i. 
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Continue this induction process k +1 steps. Combining all of the 
relations (5) gives 


k 
nh = (L fi) en) in A,/MA,, 
i=1 


k 
for some ue(Z/MZ)*. Thus f, = II fi divides y**th, in A,/p"A,. This 


holds for every n, so f,|n**th, in A. 

To conclude the proof we need to remove the extra factor of n**?. 
Recall that @ is an ideal of finite index in A and y is any element of @ 
such that A,/nA,, is finite for every n. Thus we can choose y to be a 
power of p, and then use the Ferrero—Washington theorem (Theorem 2.3 
of Chapter 10) which says that f, is not divisible by p. However, one can 
also work without using the Ferrero—Washington result, by verifying that 
it is always possible to make two choices of 4 which are relatively prime. 
Since A is a unique factorization domain, it follows that f, divides h, and 
the proof is complete. 


Still writing f, = char(C,,(y)) and h, = char(E,,(x)/V..(x)), let 


f=, and h=||[h,. 


x 


We will show fA =AA, and it will then follow from Theorem 7.2 that 
f,A =h,A for every x. The proof will use the analytic class number 
formula. If a,, b, are two sequences of positive integers we will write 
a, ~ b, to mean that a,/b, is bounded above and below independently 
of n. 


Lemma 7.3. Suppose g,, 92 € A, 9ilg2, and #(A/g,A)r, ~ #(A/92A)r,- 
Then g,A = @2A. 


Proof. This is immediate from Theorem 1.2 of Chapter 5. 
Proof of Theorem 5.1. By Theorem 1.2 of Chapter 5 and Lemma 6.6, 


#(A/fNr, ~ T] FAA, ~ [] 4G.) = #(C,), 


#(A/hA)r, ~ T] #(A/A, Ade, ~ T] CEn@) : VO] = [En Vil. 
Xx x 
But the analytic class number formula (Chapter 3, Theorem 5.1) together 
with Theorem 4.2 of Chapter 4 shows that #(C,) = [E,:V,]. Therefore 


(A/fA)r, ~ (A/hA)p,. By Theorem 7.2, f\|h so by Lemma 7.3, fA =hA. 
Again using Theorem 7.2 we conclude that f,A = h,A for all x, ie. 


char(C,,(z)) = char(E.,(x)/V..(x))- 


414 


§8. Other Formulations and Consequences of the Main Conjecture 


§8. Other Formulations and Consequences of 
the Main Conjecture 


Write w,:F >1+ pZ, for the character giving the action of I on p,« 
and q@ for the Teichmuller character giving the action of A on p,. For 
every nontrivial even character x of A let g,¢ A be the p-adic L-function 
appearing in Theorem 5.2 of Chapter 7. Then g, is characterized by the 
values 


1 
(6) wk(g,) = Ll — ky) = —( - xo™(D)PE*) | Bk, xo-® 


for positive integers k, where we view yw“ as a primitive Dirichlet 


character. The following theorem is another form of Iwasawa’s main 
conjecture; it is weaker than the consequence of Vandiver’s conjecture 
mentioned at the end of Chapter 7. 


Theorem 8.1. For all nontrivial even characters y of A, 
char(X,.(%)) = g,A. 


Proof. By class field theory (see §5 of Chapter 5, especially Theorem 
5.1), there is an exact sequence 


0 > U(W/EnlX) > Xal(X) > ColX) > 9. 


Also we clearly have 


Since the characteristic ideal is multiplicative in exact sequences, 


char(X,,(x)) = char(U,,(%)/V..(X)) = 9,A 


by Theorem 5.1 of this Appendix and Theorem 5.2 of Chapter 7. 


We now turn our attention to odd characters y. Define A, = lim Cx 


If x is any odd character of A, then y~'@ is even and using the Kummer 
duality of Chapter 6 we can relate A,(y) with X,(y‘@). 


Theorem 8.2. For every odd character y of A, Kummer theory gives a 
nondegenerate pairing 


A(X) X Xa(¥7'@) > Bye. 


Proof. This is immediate from Theorems 2.2 and 2.3 of Chapter 6. The 
only thing to check is that with Q, defined as in §2 of Chapter 6, 
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Gal(Q,/K,,)(y ‘w) = 0. This can be seen easily by Kummer theory since 
xy 'w is an even character. (Caution on notation: the C,, of Chapter 6 is 
what we are calling A,, in this Appendix.) 


Define a “twisting” map 1: A> A by i(y) = w,(y)y for y eT. 
Corollary 8.3. For every odd character x # @ of A, 

char(Hom(A,,(z), Q,/Zp)) = (gy-10)A- 
Proof. Exercise, from Theorems 8.1 and 8.2. 


Remark. By the theorem of Iwasawa [Iw 12] mentioned in Chapter 6, 
§2, C(x) is quasi-isomorphic to Hom(A,,(z), Q,/Z,), so one can conclude 
from Corollary 8.3 the following result: 


Theorem 8.4. For every odd character xy # w of A, 
char(C,,(x)) = Ug,16)A- 


This was the form of Iwasawa’s main conjecture proved by Mazur and 
Wiles. 


Remark. When x = , C,,(y7) = 0 by Corollary 2, §3 of Chapter 1. 


As a final consequence of the main conjecture we wish to determine 
the orders of the groups C,(y) for odd characters y. This is the analogue 
of Theorem 4.2 which gave the orders of C)(y) for even y. Recall 
I, = Gal(K.,/K,). 


Proposition 8.5. For every odd character x # of A, A,(y)'" = C,(y). 


Proof. Since y is odd and y#a, E,(y) = {1} (see Theorem 4.1 of 
Chapter 3). It follows by Theorem 4.3(i), Chapter 6, that the maps 
C,(y) > C,,(y%) are injective for all m>n, and so C,(y)c A,(~)™. If y 
denotes any generator of I, we have an exact sequence 


O —> Cy)" —> Cal) > CX) — CmlO/y — 1) Cn) — 0. 
By Theorem 4.1(ii1) of Chapter 5, C,(~)/(y — DC.) = C,(y) and we 
conclude that #(C,(y)'") = #(C,()) for all m>n. Thus #(A,(y)'™) = 
#(C,(x)) and the proposition follows. 


Lemma 8.6. Suppose Y is a finitely-generated torsion A-module with no 
nonzero finite A-submodules, yeT, ae 1+ pZ, and Y/(y — a)Y is finite. 
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Then 
#(Y/(y — a)Y) = #(A/(char(Y), (y — a)A)). 


Proof. Fix a quasi-isomorphism Y > G)A/f,A with finite cokernel Z. 
Since Y has no finite submodules the kernel must be trivial, and char(Y) = 
[ | fA. We have an exact diagram 


0 0 0 
rere Yy-a) ——> D (ALFA) —a) ———9 Ze-a) 


—— Y.-S SOAS Se se += 3a 


Ir Ie i 


0 —— Y — DAFA - Z —— 0 


Y/(y — a) Y ———> DAMF, » — JA —— Z/(y —- aZ —— 0 


| 


0 0 


Here Y,-,, denotes the kernel of (y—a) on Y and similarly for 


(A/f;A)y—a)> Za). We see 
rankz, (@ (A/fiA)y—a) — rankz (Y)—-a)) = rankz (Y/() = a) Y) —0 


so each f; is prime to y—a and @(A/f,A),-4 = 0. Also #(Ziy-a)) = 
#(Z/(y — a)Z) since Z is finite, so by the snake lemma 


#(Y/(y — a) Y) = #(DBAMS, y — aA). 
It is left as an exercise to show that 
#(®D AMS, y — AA) = #(AM[] fi, y — aA). 


(Hint: If f, ge A and f is prime to y — a, then multiplication by f gives 
an isomorphism A/(g, y — a)A = (f, y — a)A/(fg, y — aA.) 
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The next result was proved by Iwasawa ([Iw 12], Theorem 18). 


Lemma 8.7. For every even character x of A, X.,(y) has no nonzero 
finite A-submodules. 


Proof. By Theorem 8.2 it will suffice to show that A,(y~'w) has no 
proper A-submodules of finite index. Suppose A < A,(z~'@) is stable 
under [ and #(A,(y 'w)/A) = p* is finite. Then for some sufficiently 
large N, Gal(K,,/Ky) acts trivially on A,(y~'w)/A. If m>N, since the 
norm map Nx, .jx,,: Cm+k > Cm is surjective (Chapter 5, §4), 


Cr(x‘@) = Niken/ Kn Cmee(X '@) cA. 
Thus A,(y 1@) < A, which proves the lemma. 


The following is a strengthening of Stickelberger’s theorem, Theorem 
3.1 of Chapter 1. 


Theorem 8.8 (Mazur—Wiles, Kolyvagin). For every odd character y # w 
of A, 


#(Co(x)) = p™, 
where m(z) = ord,(B,,,-1). 
Proof. By Theorem 8.2 and Proposition 8.5, 
Co(x) = Hom(X,.(4-7), Hp)’ = Hom(X,,(¢'@)/(y — ©.) X(K"@), Mp») 


where y is a generator of I. Thus by Lemmas 8.6 and 8.7, Theorem 8.1, 
and (6), 
#(Co(W) = #(Xalx OV — On(7)) Xa(X*@)) 
#(A/(Gy-10» Y — Oxl?))A) 
= #(Z,/0(9,-10) Zp) 
= #(Z,/B,,,-:Z,), 


Il 


which is the desired equality. 


Remark. Write @, for the Stickelberger ideal in Z,[Gal(K,/Q)] as 
defined in Chapter 2. Theorem 8.8 can be viewed as stating that for odd 
characters y 4 o, 


#(Co(x)) = [Z,[A] (x): Fol) 1. 
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Without much extra difficulty one can use Theorems 8.1 and 8.2 and 
Proposition 8.5 above to show that 


#(C,(x)) = [Z,[Gal(K,/Q)] (%) : F(x]. 


Further, using results from Chapters 2 and 4 one can prove for all odd 
yx # another result of Iwasawa 


lim (Z,[Gal(K,/Q)1/2,)(x) © A/(G,-10)A- 


Together with Theorem 8.4 this gives one more equivalent version of the 
main conjecture: 


Theorem 8.9. For all odd characters y # w of A, 


char(C,,(y)) = char (anim a) 
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